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Project  Overview 

This  project  focuses  on  developing  algorithms  for  optimization  problems  that  have  intrinsic 
limitations  preventing  the  utilization  of  all  available  decision  alternatives  (problem  variables) 
and/or  the  satisfaction  of  all  constraints.  Part  of  the  optimization  decision  in  these  problems  is 
the  selection  of  which  variables  to  use  and/or  which  subset  of  constraints  to  satisfy.  We  refer  to 
these  problems  as  selective  optimization  (SO)  problems.  A  typical  example  is  the  partial 
coverage  problem  that  seeks  levels  of  resources  (facilities,  sensors,  transmission  locations)  to 
cover  or  serve  a  set  of  specified  tasks  (customers,  targets,  receivers)  while  minimizing  resource 
usage  costs.  In  many  settings,  full  coverage  is  economically  or  physically  impossible,  and  a 
typical  goal  is  to  achieve  partial  (say  95%)  coverage,  i.e.  only  a  subset  of  the  covering 
constraints  need  to  be  satisfied.  Applications  of  selective  optimization  arise  in  numerous  diverse 
areas  ranging  from  defense  to  medicine. 

The  combinatorial  aspects  of  selection  make  these  problems  extremely  difficult.  In  this  project 
we  develop  a  set  of  generic  tools  applicable  to  a  wide  class  of  selective  optimization  problems. 
Our  approach  is  based  on  standard  mixed-integer  programming  (MIP)  formulations  of  selective 
optimization  problems.  While  such  formulations  can  be  attacked  by  commercial  optimization 
solvers,  they  typically  exhibit  extremely  poor  performance.  We  develop  a  variety  of  effective 
model  and  algorithm  enhancement  techniques  for  the  standard  MIP  formulations.  These 
techniques  are  easily  integrable  into  commercial  MIP  solvers,  thereby  making  them  readily 
usable  in  applications  of  selective  optimization. 

In  the  following  we  describe  our  research  activities,  key  results,  and  publications.  Details  of  the 
results  appear  in  the  papers  appended  at  the  end  of  this  report. 

Performance  period  4/2012-4/2013 

The  first  year  of  the  project  was  devoted  to  the  following  three  activities: 

1.  Partial  covering  problems  where  a  subset  of  the  constraints  is  required  to  be  satisfied: 
We  analyzed  the  complexity  of  this  class  of  problems,  and  developed  strengthened 
formulations  and  algorithmic  techniques  which  perform  significantly  better  than  standard 
MIP  approaches.  A  paper  on  this  work  has  been  published. 

2.  Transportation  problems  where  a  subset  of  the  demands  is  to  be  satisfied:  When 
resources  are  constrained,  one  may  be  able  to  satisfy  only  a  subset  of  demands,  rather 
than  all  the  demands.  Therefore  the  transportation  problem  now  includes  two  sets  of 
decisions:  the  decision  of  which  demands  to  satisfy  and  a  decision  of  how  to  satisfy 
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these  demands.  We  study  the  complexity  of  this  problem  and  present  cases  where  the 
problem  is  tractable.  For  all  other  cases,  we  developed  cutting-plane  based  techniques 
to  improve  the  efficacy  of  a  general  purpose  IP  solver  solving  this  problem. 

3.  Scenario  decomposition  of  stochastic  combinatorial  problems:  We  developed  a  new 
decomposition  algorithm  for  stochastic  0-1  problems  based  on  selecting  solutions  from 
the  set  of  single  scenario  solutions,  and  refining  this  set  iteratively.  The  algorithm  is  very 
easily  parallelizable  and  has  the  best  computational  performance  to-date  on  various 
benchmark  problems.  A  paper  on  this  work  has  been  published. 

The  project  provided  partial  support  for  PhD  students  Feng  Qiu  and  Gustavo  Angulo.  Feng  Qiu 
graduated  in  2013  and  is  currently  a  postdoc  in  Argonne  national  labs.  One  month  for  each  of 
the  two  Pis  was  also  supported.  The  projects  funds  were  also  used  for  travel  to  various 
conferences  (INFORMS,  MIP2013,  ICSP2013)  for  dissemination  of  research  results. 

The  following  publications  resulted  from  the  2012-2013  activities: 

1.  F.  Qiu,  S.  Ahmed,  S.S.  Dey,  L.  Wolsey.  “Covering  linear  programming  with  violations,” 
INFORMS  Journal  on  Computing,  vol.  26,  pp.  531-546,  2014. 

2.  P.  Damci-Kurt,  S.S.  Dey,  S.  Kucukyavuz.  "On  the  transportation  problem  with  market 
choice”  Discrete  Applied  Mathematics,  vol.  181,  pp. 54-77,  2015. 

3.  S.  Ahmed.  “A  scenario  decomposition  algorithm  for  0-1  stochastic  programs,” 
Operations  Research  Letters,  vol.  41,  pp.  565-569,  2013. 

Performance  period  4/2013-4/2014: 

The  second  year  of  the  project  was  devoted  to  the  following  three  activities: 

1.  Strengthening  the  bounds  for  estimating  the  probability  of  k-out-of-n  events:  Given  a  set 
of  n  random  events,  represented  by  n  Bernoulli  variables,  we  consider  the  computation 
of  bounds  on  the  probability  that  k  out  of  n  events  occur  when  partial  distribution 
information  is  available.  Upper  or  lower  bound  can  be  computed  for  this  probability  using 
a  linear  program.  We  designed  inequalities  that  can  be  added  to  this  linear  program  to 
significantly  strengthen  these  bounds.  The  bounding  approach  is  very  useful  in  deriving 
relaxations  and  restrictions  of  probabilistic  set  covering  problems. 

2.  Forbidding  vertices  of  a  polytope:  Given  a  polytope  P  and  a  subset  X  of  its  vertices,  we 
study  the  complexity  of  linear  optimization  over  the  subset  of  vertices  of  P  that  are  not 
contained  in  X.  This  problem  is  closely  related  to  finding  the  k  best  basic  solutions  to  a 
linear  problem.  We  show  that  the  complexity  of  the  problem  changes  significantly 
depending  on  the  encoding  of  both  P  and  X.  Using  these  results  we  show  that  optimizing 
on  the  binary  all  different  polytope  can  be  accomplished  in  polynomial-time. 

3.  Improved  integer  L-shaped  method:  The  Integer  L-shaped  method  is  the  state-of-the-art 
algorithm  for  solving  two-stage  stochastic  programs  with  integer  recourse.  We  develop 
two  enhancements  to  this  algorithm  to  significantly  improve  its  performance.  The  first 
approach  relies  on  carefully  alternating  between  solving  integer  and  linear  subproblems, 
and  the  second  approach  uses  our  results  from  forbidding  vertices  of  a  polytope  (item  2 
above)  to  strengthen  integer  L-shaped  cuts. 
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The  project  provided  partial  support  for  PhD  students  Gustavo  Angulo  and  Kevin  Ryan.  Gustavo 
Angulo  graduated  in  Summer  2014.  He  is  currently  a  postdoctoral  researcher  at  CORE,  Belgium 
for  a  year  and  will  bee  joining  as  a  faculty  at  Pontificia  Universidad  Catolica  de  Chile.  One 
month  for  each  of  the  two  Pis  was  also  supported. 

The  following  publications  resulted  from  the  2013-2014  activities: 

1.  F.  Qiu,  S.  Ahmed,  S.S.  Dey.  “Strengthened  bounds  for  probability  of  k-out-of-n  events,” 
accepted  for  publication  in  Discrete  Applied  Math. 

2.  G.  Angulo,  S.  Ahmed,  S.S.  Dey,  V.  Kaibel.  “Forbidden  vertices,”  Mathematics  of 
Operations  Research ,  vol.40,  pp. 350-360,  2015. 

3.  G.  Angulo,  S.  Ahmed,  S.S.  Dey.  “Improving  the  integer  L-shaped  method,”  under  review 
in  the  INFORMS  J.  on  Computing. 


Performance  period  4/2014-4/2015: 

1.  Transportation  problems  with  a  cardinality  constraint  on  the  number  of  demands  to  be 
satisfied:  It  is  well-known  that  the  intersection  of  the  matching  polytope  with  a  cardinality 
constraint  is  integral.  In  this  project  we  prove  a  similar  result  for  the  polytope 
corresponding  to  the  transportation  problem  with  market  choice  (TPMC)  (studied  in 
performance  period  4/2012-4/2014)  when  the  demands  are  in  a  specific  set.  This  result 
generalizes  the  result  regarding  the  matching  polytope  and  also  implies  that  some 
special  classes  of  minimum  weight  perfect  matching  problem  with  a  cardinality  constraint 
on  a  subset  of  edges  can  be  solved  in  polynomial  time. 

2.  Intersection  on  mixing  sets  with  cardinality  constraint:  Intersection  of  two  mixing  sets 
with  a  cardinality  constraint  arises  as  a  relaxation  of  deterministic  equivalent  of  chance- 
constrained  programming  problems  with  finite  discrete  distributions.  We  study  an 
extended  formulation  of  this  set  and  describe  the  convex  hull  in  some  special  cases. 

The  project  provided  partial  support  for  PhD  students  Kevin  Ryan  and  Qianyi  Wang.  One  month 
for  each  of  the  two  Pis  was  also  supported. 

The  following  publications  resulted  from  the  2013-2014  activities: 

1.  M.  Walter,  P.  Damci-Kurt,  S.S.  Dey,  S.  Kucukyavuz.  “On  a  Cardinality-Constrained 
Transportation  Problem  With  Market  Choice”  submitted  for  publication,  2015. 

2.  K.  Ryan,  S.  Ahmed,  S.S.  Dey.  “Intersection  of  mixing  sets  with  cardinality  constraint,” 
working  paper,  2015. 
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Covering  Linear  Programming  with  Violations 

Feng  Qiu,  Shabbir  Ahmed,  Santanu  S.  Dey  and  Laurence  A.  Wolsey 

May  8,  2012 


Abstract 

We  consider  a  class  of  linear  programs  involving  a  set  of  covering  constraints  of  which  at  most  k  are 
allowed  to  be  violated.  We  show  that  this  covering  linear  program  with  violation  is  strongly  WP-hard.  In 
order  to  improve  the  performance  of  mixed-integer  programming  (MIP)  based  schemes  for  these  problems, 
we  introduce  and  analyze  a  coefficient  strengthening  scheme,  adapt  and  analyze  an  existing  cutting  plane 
technique,  and  present  a  branching  technique.  Through  computational  experiments,  we  empirically  verify 
that  these  techniques  are  significantly  effective  in  improving  solution  times  over  the  CPLEX  MIP  solver. 
In  particular,  we  observe  that  the  proposed  schemes  can  cut  down  solution  times  from  as  much  as  six 
days  to  under  four  hours  in  some  instances. 


1  Introduction 

A  point  belongs  to  the  feasible  region  of  a  linear  program  (LP)  if  it  satisfies  all  the  linear  constraints  defining 
the  LP.  However,  when  certain  problems  are  being  modeled,  the  feasibility  requirement  is  soft.  That  is,  a 
point  is  considered  feasible  even  if  it  violates  no  more  than  a  specified  number  of  the  constraints  defining  the 
problem.  Such  a  linear  program  is  called  a  k-violation  linear  program  (KVLP)  [19]: 

min  cT  x 

s.t.  aj x  >  bi  i  =  l,...,m,  (1) 

at  most  k  of  the  m  constraints  can  be  violated, 
x  G  R" . 

The  feasible  region  of  a  KVLP  is  the  union  of  (™)  polyhedral  sets,  each  of  which  are  defined  by  the  intersection 
of  some  subset  of  (to  —  k )  inequalities  from  the  to  inequalities  in  (1).  In  general,  such  a  feasible  region  is 
nonconvex  and  KVLP  is  a  strongly  AfV- hard  optimization  problem  [1],  Much  of  the  existing  work  on  this 
class  of  problems  focuses  on  polynomial  time  algorithms  for  low  dimensional  problems  (i.e.  n  is  fixed  and 
small)  (cf.  [5]  for  a  survey). 

This  paper  addresses  KVLPs  where  the  linear  system  (1)  consists  of  covering  type  linear  inequalities, 
i.e.,  ai  and  bi  are  non-negative  for  all  i.  We  call  such  a  problem  a  covering-type  k-violation  linear  program 
(CKVLP).  CKVLPs,  which  are  an  important  subclass  of  KVLPs,  have  many  applications. 

As  a  concrete  example,  consider  a  probabilistically-constrained  portfolio  optimization  problem  [16]  to 
determine  a  minimum  cost  distribution  of  a  unit  investment  among  n  assets  with  uncertain  returns,  requiring 
the  overall  return  to  be  at  least  r  with  a  probability  of  1  —  e,  where  e  G  (0, 1)  is  a  prespecified  risk  level.  A 
formulation  of  this  problem  is 


min  cT  x 
s.t.  eTx  =  1 

P {aT x  >  r}  >  1  —  e  (2) 

x  G  R”, 


1 
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where  a  is  the  random  return  vector  for  n  assets  following  some  known  distribution,  P{A}  denotes  the 
probability  of  the  random  event  A ,  and  c  is  the  cost  vector.  A  common  approach  to  dealing  with  the 
probabilistic  constraint  in  (2)  is  the  sample  average  approximation  method  [12]  where  the  distribution  of  a 
is  approximated  by  an  empirical  distribution  corresponding  to  an  i.i.d  sample  of  return  vectors  {ai}7^=1.  The 
approximated  problem  then  reads  as  follows: 

min  cT  x 

s.t.  eTx  =  1 

aj x  >  r  i  =  l,...,m,  (3) 

at  most  k  of  the  m  constraints  can  be  violated, 
x  £  R™ , 

where  k  =  \rne\ .  Since  the  return  is  non-negative  and  only  nonnegative  investments  are  allowed,  (3)  is  an 
example  of  CKVLP  with  an  additional  equality  constraint.  In  Section  6,  we  discuss  a  similar  application 
of  CKVLP  in  an  optimal  vaccine  allocation  under  probabilistic  constraints  [18].  Additional  applications  of 
CKVLP  arise  in  intensity  modulated  radiation  therapy  (IMRT)  planning  [20]  and  signal  broadcasting  coverage 
design  [17]. 

A  CKVLP  can  be  modeled  as  a  mixed  integer  program  (MIP)  in  a  straight-forward  manner.  First,  note 
that  if  hi  =  0  for  any  i  £  {1, . . . ,  m},  then  the  corresponding  inequality  is  redundant  since  then  the  inequality 
is  implied  by  the  non-negativity  constraints  on  the  x  variables.  Thus,  we  assume  henceforth  that  bi  >  0  for 
all*  £  {1, . . .  ,m}  and  so  they  can  be  scaled  to  1.  Then,  an  MIP  formulation  of  CKVLP  is 

min  cT  x 

s.t.  aj x  +  Zi  >  1  i  =  l,...,m,  (4) 

m 

^2zi<k 
i— 1 

x  £  M" ,  Zi  £  {0, 1}  i  =  l,...,m, 

where  we  have  introduced  the  binary  variables  z,  taking  the  value  1  if  the  i-tli  constraint  is  violated.  For 
large  scale  CKVLPs,  the  above  MIP  formulation  performs  very  poorly.  The  goal  of  this  paper  is  to  study 
a  number  of  enhancement  schemes  to  improve  the  computational  performance  of  MIP-based  approaches  for 
solving  CKVLPs. 

We  begin  by  studying  the  theoretical  complexity  of  CKVLPs  and  illustrating  the  difficulty  of  solving 
realistic  instances  directly  by  the  CPLEX  MIP  solver  (Section  2)  as  well.  Next,  in  order  to  improve  the 
performance  of  standard  solvers  on  the  MIP  model  (4)  of  CKVLPs,  we  introduce  and  analyze  a  coefficient 
strengthening  scheme  (Section  3),  adapt  and  analyze  an  existing  cutting  plane  technique  (Section  4),  and 
present  a  branching  technique  (Section  5).  Through  computational  experiments  on  the  probabilistic  portfolio 
optimization  problem  (3)  and  an  optimal  vaccination  allocation  problem,  we  empirically  verify  that  these 
techniques  are  extremely  effective  in  improving  solution  times  (Section  6).  In  particular,  we  observe  that 
the  proposed  schemes  can  cut  down  solution  times  from  as  much  as  six  days  to  under  four  hours  in  some 
instances. 

We  close  this  section  by  pointing  out  that  all  three  enhancement  schemes  studied  here  are  applicable  when 
there  are  additional  side  constraints  in  the  MIP  (4).  This  follows  since  these  schemes  attempt  to  tighten  the 
LP  relaxation  of  (4),  which  is  a  valid  relaxation  even  when  additional  side  constraints  are  present. 

2  Difficulty  of  Solving  CKVLP 

2.1  Computational  Complexity 

General  KVLP  has  been  shown  to  be  A/""P-complete  [1] .  However,  to  the  best  of  our  knowledge,  the  complexity 
of  CKVLP,  a  sub-class  of  KVLP,  has  not  been  addressed.  In  a  recent  paper  [20],  Tungel  et  al.  showed  that 
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a  packing  version  KVLP  is  weakly  MV-haid  (the  linear  inequalities  in  KVLP  are  packing  inequalities)  by 
reduction  from  the  partition  problem.  This  result  can  be  modified  to  show  the  A/”'P-hardness  of  CKVLP.  In 
this  paper  we  provide  a  direct  proof  that  CKVLP  is  strongly  MV-L&rd. 

By  complementing  the  binary  variables  z  in  (4),  we  have  the  following  equivalent  formulation  of  CKVLP: 


mm 

s.t. 


CTX 

Ax  >  z 
eT  z  >  p 
x  €  R? 


(5) 


C  G 


*G{0,1}"*, 

})" ,  p  =  m  —  k,  e  is  the  column  vector  with  each  entry  equal  to  1,  and 


where  A  =  [oj",  G  Q” 

Q  is  the  set  of  rationals. 

To  prove  that  CKVLP  (5)  is  MV- hard,  we  first  verify  that  the  following  intermediate  decision  problem  is 
A/’7?-complete. 


Intermediate  CKVLP  Feasibility  Problem:  Given  i;  G  Q,  A  G 
a  solution  (x,  z )  G  R"  x  {0,  l}m  to  the  following  system? 


and 


cT  x  —  eT  z 


Ax 


<  p 

'■  z. 


c  G  Q”,  does  there  exist 


(6) 


Lemma  1.  The  Intermediate  CKVLP  Feasibility  Problem  (6)  is  strongly  MV -complete. 

Proof.  Since  (6)  is  a  decision  version  of  a  mixed  integer  linear  program,  it  is  in  MV.  In  order  to  show  that 
determining  the  feasibility  of  (6)  is  strongly  .V'P-complete,  we  polynomially  reduce  an  arbitrary  instance  of 
the  strongly  .V'P-complete  vertex  cover  problem  [8]  to  an  instance  of  (6).  An  instance  of  the  vertex  cover 
problem  is  defined  as  follows: 

Vertex  Cover:  Given  a  graph  G  =  (V,E)  and  q  G  N,  does  there  exist  S  C  V  such  that  (i)  |S|  <  q 
and  (ii)  S’  is  a  vertex  cover,  that  is  for  all  (i,j)  G  E,  either  i  G  S  or  j  G  S'? 

Given  an  instance  of  the  vertex  cover  problem,  we  construct  an  instance  of  (6)  by  setting  m  :=  \V\  +  E |, 

n  :=  \V\,  i)  :=  q  —  \E\,  c  :=  2e,  A  :=  j  ,  where  H  is  the  node-arc  incidence  matrix  of  G  and  I  is  a 

\V\  x  \V\  identity  matrix.  The  resulting  instance  of  (6)  is  then: 


KV 


KV 


1 

VI 

■05 

(7) 

Xi  +  Xj  >  ylj 

V  (i,j)  G  E 

(8) 

Xi  >  Zi 

V  i  G  V 

(9) 

x  G  R^1 

(10) 

2  G  {0,l}|y| 

(11) 

2/G{0,l}|s|. 

(12) 

Note  that  the  size  of  (7)-(12)  is  polynomial  in  the  encoding  length  of  G  and  q.  We  complete  the  proof  by 
showing  that  a  vertex  cover  instance  has  an  answer  yes  if  and  only  if  the  associated  system  (7)-(12)  has  a 
solution. 

(=>)  Let  S'  be  a  vertex  cover  for  G  such  that  |S|  <  q.  Then,  consider  a  solution  ( x,y,z )  G  r!^  x  {0, 1}^  x 
{0, 1}IV I  defined  as: 


Xn  =  Zn  = 


Vi, 3  = 


1  VjGS 
o  VjGV\S, 

lV(i,j)GT 
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The  solution  ( x,y,z )  satisfies  (9)-(12)  by  construction,  and  since  S  is  a  vertex  cover  it  also  satisfies  (8). 
Finally,  2  Yjjevxo  ~  J2jgv  S3  ~  =  \S\  ~  \E\  <  q  -  \E\.  Thus  the  system  (7)-(12)  has  a  solution. 


(<t=)  Now  assume  that  the  system  (7)-(12)  has  a  solution.  Note  that  an  arbitrary  feasible  solution  to  (7)- 
(12)  may  have  fractional  x  components  that  cannot  be  directly  converted  to  a  vertex  cover  for  G.  We 
show  that  there  exists  a  feasible  solution  of  (7)-(12)  with  integral  values  of  x  and  y  =  e  whenever  (7)- 
(12)  is  feasible.  Towards  this  end,  we  first  present  some  properties  of  feasible  solutions  to  (7)-(12).  Given 
(, x,y,z )  £  x  {0,1}I®I  x  {0,1}^  I,  which  satisfies  (8)-(12),  let 

f(x,y,z)  :=2j2xj  ~J2zj  ~  H 

jev  j&v  (ij)eE 

i.e.,  if  ( x,y,z )  is  feasible  for  (7)-(12),  then  f(x,y,z )  <  q—  \E\. 


Claim  a.  Given  (x1,  j/1,  z1)  satisfying  (8)-(12),  there  exists  ( x2,y2,z 2)  satisfying  (8)-(12)  such  that  y2  =  e 
i.e.  a  vector  of  ones,  and  f(x2,y2,z2)  <  fix1,  y1,  z1). 


Proof  of  Claim  a.  Suppose  there  exists  (i,j)  £  E  such  that  yj~.  =  0.  Construct  ( x3,y3,z 3)  as  follows: 


f  1 

3  =  i 

l x) 

J£V\ 

{  1 

j  =  i 

\  z) 

3&V\ 

(m) 

=  (iJ) 

Vij  (hj)  e  E\{(i,j)}. 


It  is  easy  to  see  that  (x3,  y 3,  z3)  satisfies  (8)-(12).  We  observe  that  fix1,  y1,  z1)  —  f(x3,  y3,  z3)  =  (2xf  —  zj  — 
ylj)  —  (2  x  1  —  1  —  1)  =  xj  +  (x|  —  zf )  >  0,  where  the  last  inequality  holds  due  to  the  fact  that  (x1,  y1,  z1) 
satisfies  (9).  By  repeating  the  above  construction  at  most  \E\  times  we  arrive  at  a  solution  ( x2,y2,z 2)  satis¬ 
fying  the  claim, 


We  now  restrict  our  attention  to  feasible  solutions  of  (7)-(12)  with  the  vector  y  fixed  to  e.  Next,  we  show 
that  a  feasible  solution  with  integral  x  components  exists. 


Claim  b.  Given  (x1,  e,  z1)  satisfying  (8)-(12),  there  exists  a  solution  (x2,  e,  z2)  satisfying  (8)-(12)  such  that 
x2  £  {0, 1}I'AI  and  /(x2,  e,  z2)  <  fix1,  e,  z1). 

Proof  of  Claim  b.  If  x1  £  {0,1}^  I,  then  there  is  nothing  to  verify.  If  there  exists  j  such  that  xj  >  1,  then 
we  can  set  x1  =  1.  The  resulting  solution  still  satisfies  (8)-(12),  and  the  value  of  the  function  /  reduces. 
Therefore,  the  non-trivial  case  is  when  there  exists  j  such  that  xk  £  (0, 1).  In  this  case,  we  construct  a  solution 
ix3,e,z3)  as  follows.  Examine  the  set  of  neighboring  vertices  N ij)  of  the  vertex  j.  If  xj  +  x\  >  yj.  =  1  for 
all  i  £  N(j)  then  we  may  reduce  the  value  of  xj  by  a  sufficiently  small  positive  value  so  that  (x^e,  z1)  still 
satisfies  (8)-(12)  and  the  value  of  fixl,e,  z1)  reduces.  Therefore,  we  may  assume  that  there  exists  a  vertex 
i  £  N(j)  such  that  xj  +  xj  =  1.  Without  loss  of  generality,  we  may  assume  that  1  >  xj  >  \  (otherwise  we 
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can  swap  i  and  j),  which  also  implies  that  z~.  =  0.  Next,  we  construct  (a;3,  e,  z3)  as  follows: 

x3-{  1  j=j  ~ 

3  I  x)  j&v\{j }’ 

z3-f  1  3=3  ~ 

3  1  z)  jeV\{j}  ■ 

It  is  easy  to  see  that  (x3,e,z3)  with  x3  G  {0,1}  as  constructed  above  satisfies  (8)-(12).  Furthermore 
/(V,  e,  2:1)  —  f(x3,e,z3)  =  2x1  —  (2  —  1)  =  2x1  —  1  >  0.  By  repeating  the  above  construction  at  most 
\V\  times,  we  obtain  the  required  (x2,e,z2)  satisfying  the  claim.  <C> 

From  the  claims  a  and  b,  it  is  clear  that  there  exists  a  feasible  solution  of  the  form  (x,  y ,  z)  with  (i)  y  =  e 
and  (ii)  x  G  {0, 1}0  .  If  ajy  =  1  and  Zj  =  0  for  some  j,  then  we  can  set  z3  =  1,  and  the  resulting  solution 
is  still  feasible  for  (7)-(12).  Therefore,  we  may  assume  that  the  feasible  solution  also  satisfies  Xj  =  Zj  for  all 
j  G  V.  We  select  any  such  feasible  solution  and  let  S  =  {j  :  Xj  =  1}.  Clearly,  S  is  a  vertex  cover  for  G  since 
y  =  e.  Notice  that  f(x,  y,  z )  =  2|S|  —  |5|  —  \E\  <  q  —  \E\  or  equivalently  |5|  <  q.  □ 

Theorem  1.  CKVLP  is  strongly  MV -hard. 

Proof.  To  verify  that  (5)  is  MV- hard,  we  show  that  if  there  exists  a  polynomial  time  algorithm  for  solving 
(5),  then  there  is  a  polynomial  time  algorithm  for  deciding  the  feasibility  of  (6).  This  completes  the  proof, 
since  by  Lemma  1,  we  have  that  deciding  the  feasibility  of  (6)  is  ./V'P-complete. 

Let  v(p)  denote  the  optimal  value  of  (5)  as  a  function  of  p  G  {0, . . . ,  m}.  Consider  the  following  algorithm 
for  deciding  the  feasibility  of  (6): 

1.  Given  A  G  Z™xn,  c  G  Z",  and  ry  G  Z,  compute  v{jp)  for  all  p  G  {0, . . . ,  m},  using  the  polynomial-time 
algorithm  for  solving  (5). 

2.  Compute  rj*  :=  min  {v(p)  —p}.  If  y*  <  rj,  return  “yes,”  (i.e.  (6)  is  feasible);  otherwise  return  “no.” 

0<p<m 

Notice  that  the  above  algorithm  is  a  polynomial  time  algorithm  in  the  size  of  the  encoding  of  (6).  It  remains 
to  verify  the  validity  of  the  above  algorithm. 

Suppose  rf  <  y  and  p*  G  argmin{v(p)  —  p}.  Consider  an  optimal  solution  (x*,z*)  to  (5)  corresponding 
to  p  =  p* .  Since  y  >  y*  =  v(p*)  —  p*  >  v(p*)  —  eT z*  =  cTx*  —  eTz*,  the  instance  of  (6)  is  feasible. 

Suppose  y*  >  y.  Assume  by  contradiction  that  the  instance  of  (6)  is  feasible  and  let  (x*,z*)  be  a  feasible 
point.  Let  p*  =  YHj=iz*j-  Then,  observe  that  {x*,z*)  is  feasible  to  (5)  corresponding  to  p  =  p* .  Thus, 
y*  <  v(p*)  —  p*  <  cTx*  —  p*  <  y,  a  contradiction.  □ 

2.2  Performance  of  a  standard  MIP  solver  on  CKVLP  instances 

Given  the  significant  advancements  made  in  MIP  solution  technology,  many  instances  of  VP-hard  problems 
are  not  necessarily  difficult  to  solve  in  practice.  To  assess  the  practical  computational  difficulty  of  CKVLP, 
we  next  report  on  the  performance  of  CPLEX,  a  state-of-the-art  MIP  solver,  on  randomly  generated  instances 
of  the  MIP  (4). 

We  consider  instances  with  n  =  20,  m  =  200  and  k  G  {15,  20}.  The  data  is  generated  as  follows: 

1.  “Dense  Data”:  Each  left-hand-side  coefficient  a,y  is  generated  uniformly  between  0.8  and  1.5,  and  then 
the  coefficients  are  divided  by  1.1.  The  cost  vector  is  a  vector  of  ones. 

2.  “Sparse  Data”:  This  uses  the  same  input  data  as  in  “Dense  Data”,  except  that  half  of  the  left-hand-side 
coefficients  are  randomly  set  to  zero. 
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3.  “Random  Objective”:  These  instances  have  the  same  constraint  coefficients  as  in  “Dense  Data”,  but 
with  random  integer  cost  coefficients  between  1  and  100. 

For  each  of  the  six  combinations  of  two  values  of  k  and  three  data  classes,  we  considered  10  instances  with  a 
total  for  60  instances.  The  computations  are  run  on  Intel  Xeon  2.27  GHz  dual  core  Linux  server  installed  with 
4  Gb  RAM.  The  model  is  implemented  with  the  callable  libraries  and  solved  by  the  MIP  solver  in  CPLEX 
12.1  with  default  settings. 

The  average  results  over  ten  instances  in  each  size-data  combination  are  presented  in  Table  1.  The  ‘Gap’ 
column  in  the  table  reports  the  root  node  LP  relaxation  gap  closed  by  CPLEX  cuts.  That  is,  the  value 
^  x  100,  where  ZLP+Cuts .  zLP ,  and  z*  are  the  objective  function  values  of  the  LP  relaxation 
with  CPLEX  cuts  at  the  root  node,  of  just  the  LP  relaxation,  and  of  the  MIP,  respectively.  The  ‘Nodes’  and 
the  ‘Time’  columns  report  the  number  of  branch-and-bound  tree  nodes  generated  and  the  time  in  seconds 
needed  to  solve  the  instances  to  optimality,  respectively. 


Dense  Data 

Sparse  Data 

Random  Objective 

k 

Gap 

Nodes 

Time 

Gap 

Nodes 

Time 

Gap 

Nodes 

Time 

15 

2% 

3,537,864 

2,454 

7% 

158,039 

83 

17% 

1,777 

1 

20 

2% 

43,296,679 

25,948 

6% 

1,769,574 

917 

21% 

6,227 

2 

Table  1:  Performance  of  CPLEX  on  CKVLP 

Following  are  some  observations  based  on  the  above  computations. 

1.  The  effect  of  k:  Setting  k  to  a  larger  value  results  in  a  substantial  increase  in  time  and  memory 
consumption  (measured  in  the  number  of  nodes  in  the  branch-and-bound  tree),  as  seen  by  a  ten¬ 
fold  increase  for  the  first  two  types  of  instances.  This  phenomenon  can  perhaps  be  explained  by  the 
combinatorial  nature  of  CKVLP,  which  is  to  choose  the  linear  program  with  the  best  objective  value 
among  (™)  linear  programs.  When  k  increases  to  the  number  of  possible  linear  programs  increases 
rapidly. 

2.  The  effect  of  sparsity:  The  coefficient  matrix  density,  measured  by  the  number  of  non-zeros,  can  make 
instances  significantly  harder  to  solve,  as  seen  by  a  20-time  increase  in  nodes  and  30-time  increase 
in  time  when  the  density  increases  from  50%  to  100%.  The  dense  coefficients  not  only  make  the  LP 
relaxation  hard  to  solve,  but  also  make  it  hard  for  CPLEX  to  find  effective  cuts,  e.g.,  CPLEX  default 
cuts  close  only  2%  of  the  LP  relaxation  gap  in  the  “Dense  Data”  instances,  whereas  6-7%  of  the  gap  is 
closed  in  the  “Sparse  Data”  instances. 

3.  The  effect  of  objective  function:  The  objective  function  coefficients  play  a  crucial  role  in  determining 
the  computational  difficulty,  as  demonstrated  by  the  contrast  between  “Dense  Data”  and  “Random 
Objective” .  The  instances  with  random  objective  coefficients  can  be  solved  in  seconds;  however,  the 
instances  with  the  same  constraints  but  uniform  objective  coefficients  in  “Dense  Data”  take  hours  to 
solve.  When  the  cost  coefficients  and  the  constraint  coefficients  are  set  up  in  a  way  so  that  the  objective 
values  of  linear  programs  formed  by  different  choices  of  linear  constraints  are  close,  the  branch-and- 
bound  procedure  generates  a  great  number  of  nodes,  of  which  the  LPs  are  similar  in  terms  of  bounds, 
and  the  MIP  solver  spends  an  enormous  amount  of  time  on  proving  optimality. 

In  the  rest  of  the  paper,  we  focus  on  variants  of  the  most  difficult  class  of  the  above  instances,  that  is, 
instances  that  are  very  similar  in  type  to  “Dense  Data,”  and  attempt  to  tighten  the  root  node  lower  bound 
and  reduce  the  size  of  the  search  tree. 
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3  Iterative  Coefficient  Strengthening 

In  this  section,  we  propose  and  analyze  a  scheme  for  strengthening  the  coefficients  of  the  binary  variables  in 
the  MIP  formulation  (4)  of  CKVLP.  Let  X  denote  the  set  of  feasible  x  solutions  of  (4),  i.e. 

m 

X  :=  {x  £  R"  :  3z  £  {0, l}m  s.t.  aj x  +  Zi  >  1  V  i  =  1, to  and  ^  Zi  <  k}.  (13) 

i=i 

Definition  1.  A  vector  l  £  Rm  is  called  a  valid  bound  vector  if  Ii  <  minja^ x  :  x  £  X}  for  all  i  =  1, . . . ,  to. 
Given  a  valid  bound  vector  t,  let 

m 

X(£)  :=  {x  £  R"  :  3 z  £  [0,  l]m  s.t.  aj x  +  (1  —  £f)Zi  >  1  V  i  =  1, ...,  to  and  ^  3%  <  k}. 

i=  1 

Proposition  2.  (i)  If  £  is  a  valid  bound  vector  then  X(£)  D  X.  (ii)  The  bound  vector  £  =  0  is  valid,  (in) 
For  valid  bounds  £l  and  £2 ,  if  £2  >  £x  then  X(£x)  D  X(£2). 

Proof,  (i)  If  x  £  X  then  there  exists  z  £  {0,  l}m  such  that  aj  x  >  max{l  —  for  all*  =  1, . . . ,  to  and 

zi  —  k-  Since  max{l  —  Zi,£i}  =  1  —  (1  —  £f)Zi  when  Zj  £  {0, 1},  it  follows  that  aj x  +  (1  —  £f)Zi  >  1  and 
x  £  X(£).  (ii)  Since  aTx  >  0  for  all  x  £  R",  we  obtain  that  £  =  0  is  a  valid  bound  vector,  (iii)  If  £  £  X(£2) 
then  there  exists  z  £  [0,  l]m  such  that  aj  x  >  1  —  (1  —  £2)zi  for  all*  =  1, . . . ,  to  and  Yl'iL  \  zi  ^  Since  z,  >  0 
this  implies  that  aj x  >  1  —  (1  —  £\ )zt,  hence  x  £  X(£l).  □ 

Note  that  V(0)  is  the  projection,  on  to  the  x  variables,  of  the  LP  relaxation  of  the  MIP  formulation  (4). 
Proposition  2  suggests  that  we  can  strengthen  this  LP  relaxation  by  iteratively  tightening  the  bound  vector 
£  and  hence  the  coefficients  of  the  binary  variables  in  (4),  starting  from  t  =  0.  Algorithm  1  describes  such 
a  coefficient  strengthening  procedure.  Note  that  procedure  requires  solving  to  feasible  linear  programs  with 
bounded  objectives  in  each  iteration. 


Algorithm  1  Iterative  Coefficient  Strengthening 

Input  :  A  threshold  parameter  e  >  0  and  the  data  (to,  n,  k,  al3 )  describing  X 
Output  :  A  valid  bound  vector  £  £  R™ 

A  <-  2e,  t  <-  1,  t  <-  0 
while  A  >  e  do 
for  i  =  1, ...,  to  do 

f?-+1  =  min  {aj  x  :  x  £  A'(^t)} 

end  for 

A  «—  \\£t+1  -  PIL 

t  i —  t  -(-  1 

end  while 

£<-£* 


Proposition  3.  Let  {£*}  be  the  sequence  of  bound  vectors  produced  in  Algorithm  1.  We  have  (i)  £t+1  >  £l 
and  (ii)  £l  is  a  valid  bound  vector  for  all  t.  Accordingly ,  Algorithm  1  terminates  finitely  returning  a  valid 
bound  vector  £. 

Proof.  We  proceed  by  induction  on  t.  For  the  base  case  t  =  1  we  have  £x  =  0,  then  (ii)  holds  from  part  (ii) 
of  Proposition  2.  Moreover  £2  =  minja^a:  :  x  £  A'(0)}  >  0  for  all  i,  hence  (i)  holds.  Suppose  now  that  (i) 
and  (ii)  hold  for  some  t  >  1.  By  definition  =  min{aira-’  :  x  £  A(f*)}  for  all  i  =  1, . . . ,  to.  Thus,  for 
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each  i  =  1 ,m,  £\+1  <  aj x  for  all  x  G  X(£f)  and  lienee  for  all  x  €  X  since  X  C  X {£))  from  the  validity 
of  £f .  Thus  £t+1  is  a  valid  bound  vector  and  (ii)  holds  for  all  t.  By  our  induction  hypothesis  £t+1  >  £f  thus 
X(£t+1)  C  X(£*)  by  part  (iii)  of  Proposition  2.  Thus  £-+2  =  minja^a;  :  x  G  X(£t+1)}  >  minja^a;  :  x  G 
X(£f)}  =  £-+1  for  all  i  =  1 ,m,  and  so  (i)  holds  for  all  t.  Finally  note  that,  for  any  t,  X (£*)  D  X  from 
part  (i)  of  Proposition  2,  thus  i\  —  minja^a;  :  x  G  X{£*)}  <  minja^a;  :  x  G  X}  =:  £*,  where  £*  is  a 
well  defined  finite  value,  for  all  i  =  1, ...  ,m.  Thus,  for  each  i  =  1, . . . , m,  {£*}  is  a  bounded  nondecreasing 
sequence,  and  hence  convergent.  It  follows  that  for  any  £  >  0  there  exists  a  sufficiently  large  value  of  t  such 
that  ||  £t+1  —  £*11^  <  £  ensuring  finite  termination  of  the  algorithm.  □ 

Next  we  analyze  the  strength  of  the  LP  relaxation  of  (4)  using  tightened  coefficients  derived  using  Algo¬ 
rithm  1.  Given  a  cost  vector  c,  let 

v*  =  min{cTa:  :  x  G  X}  and  zL(£)  =  min{cTa:  :  x  G  X{£)},  (14) 

be  the  optimal  value  of  the  MIP  (4)  and  the  optimal  value  of  the  LP  relaxation  corresponding  to  bound 
vector  £,  respectively.  Note  that  these  values  are  finite  as  long  as  c  >  0.  Recall  that  vL(0)  is  the  natural 
LP  relaxation  bound  for  (4),  and  the  coefficient  tightening  scheme  in  Algorithm  1  is  aimed  to  improve  this 
bound.  In  the  following  we  analyze  this  improvement  as  a  function  of  the  instance  data.  For  simplicity  of 
the  analysis  we  assume  that  Cj  >  0  and  a-ij  >  0  for  all  i  and  j.  Let 


p  =  min  min 


_ aij 

(1  /m)£?=1 


(15) 


Note  that  p  is  a  measure  of  the  variability  of  the  constraint  coefficient  data  and  p  G  (0,1].  Let  {£*}  be 
the  sequence  of  bound  vectors  produced  by  the  scheme  in  Algorithm  1  with  a  threshold  of  £  =  0.  From 
Proposition  3  we  know  that  this  sequence  is  convergent.  Let 


t  =  lim  £\ 

t—too 


(16) 


Recall  that  m  is  the  number  of  constraints  in  (4)  and  k  is  maximum  number  of  constraints  allowed  to  be 
violated. 

Lemma  2.  Assuming  atj  >  0  for  all  i  =  1, . . . ,  m  and  j  =  1, . . . ,  n, 


e*  ^  m  -  k 

£i  >  - tP  Vi=l,...,m, 

m  —  pk 

where  p  and  £*  are  as  defined  in  (15)  and  (16),  respectively. 

Proof.  Let  {w*}  be  a  sequence  of  m  dimensional  vectors  defined  by  the  following  recursion: 

u\  —  0  and  it-+1  =  p(  1  —  (1  —  u\)k/m )  Vi=l,...,m,  V  t  >  1.  (17) 


First,  we  claim  that 

£*  >  ul  >  0  V  t  >  1.  (18) 

We  prove  this  claim  by  induction  on  t.  Note  that  (18)  holds  for  t  =  1  since  £\  =  uj  =  0  for  all  i.  =  1, . . . ,  m. 
Suppose  now  that  (18)  holds  for  some  t  >  1.  Since  u\  >  0  and  0  <  k/m  <  1  we  have  that  (1  —  (1  —  u\)k/m)  = 
{l  —  k/m)+u\k/m  >  0,  and  hence  u*+1  >  0.  Let  p,j  =  aij/m  f°r  J  =  1, . . .  ,n  and  p  be  the  corresponding 
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n-dinrensional  vector.  For  any  i  =  1, . . . ,  m, 

/tf+1 

H 

min{o7  x 

:  xeX^)} 

(19) 

> 

min{o7  x 

:  x  G  X(u*)} 

(20) 

= 

min{o7  x 

:  aj,x  +  (1  —  u\,)zi'  >  1  V  i'  =  1, . . . 

,  m,  ^  zv  <  k,  x  G  R" ,  2  G  [0, 1]™} 

i'  =  l 

(21) 

> 

minja^  x 

:  pTx  >  1  —  (1  —  w*)fc/m, x  G  R"} 

(22) 

= 

(1  -  (1  - 

ul)k/m)(m in  {aij/pj}) 

(23) 

> 

p(l-(l- 

-  u\)k/m) 

(24) 

= 

u*+1 

(25) 

where  (20)  follows  from  the  induction  hypothesis  £*  >  u*  since  X(P)  C  X(w*)  by  Proposition  2(iii);  (21) 
follows  from  the  definition  of  X(u*);  (22)  follows  by  aggregating  the  m  rows  of  the  linear  program  defined  in 
(21),  noting  that  u\,  =  u\  for  all  i  and  i' ,  and  eliminating  the  2  variables;  since  (1  —  (1  —  vffjk/m)  >  0,  (23) 
follows  from  the  optimal  solution  of  the  single  constrained  linear  program  defined  in  (22);  (24)  follows  from 
the  definition  of  p;  and  (25)  follows  from  the  definition  of  u\+l .  Thus  (18)  holds. 


Next  we  claim  that,  for  all  *  =  1, . . . ,  m,  {it*}  is  convergent  and 


lim  it*  = 

t—¥  OO 


m  —  k 
to  —  pk^ 


(26) 


Consider  any  i  G  {1, . . . ,  m}.  We  first  verify  that  it*  <  ™_}fkp  for  all  t.  We  proceed  by  induction  on  t.  By  def¬ 
inition  u\  =  0  <  ™IpkP-  By  induction  hypothesis,  we  have  that  it*  <  ™IkkP-  Now  tt*+1  =  p  —  p ^  +  p^w*  < 
p  —  p ^  +  p}|  f  ™Ipk p)  =  m-pk P-  Now  we  vei'ify  that  the  sequence  {it*}  is  non-decreasing.  Observe  that 


u*  -  it*+1  - 


=  <-(p-P^+  P^t)  =  ^  (1  -P^)~P  +  P±<  (%E&p)  (1  ~Pi)-P  +  P^  =  0.  Finally 


suppose  by  contradiction  that  the  sequence  {it*}  converges  to  a  value  ™_kkp  _  where  S  >  0.  Therefore, 
there  exists  a  t  such  that  m~k  p  —  5  >  it*  >  m~k,p—  S  —  e,  in  which  e  =  8  (l  —  p—) .  Since  p—  <  1,  we  have 
(1  -  p±)  <  1.  Hence,  we  obtain  it*  -  tt*+1  <  {j^p  -  s)  (l  -  p£)  -  p  +  p^  =  ~{S)  (l  -  p±)  =  -e.  Thus, 
w*+1  >  it*  +  e  >  ™Zpk  P  ~  8  which  is  a  contradiction.  Thus  (26)  holds. 


It  then  follows  from  (18)  and  (26)  that 


?7t  —  k 

> - - p  V  i  =  l,...,  to. 

m  —  pk 


Theorem  4.  Assuming  Cj  >  0  and  aij  >  0  for  all  i  =  1, . . . ,  m  and  j  =  1, . . . ,  m, 

v*  —  vL(t*)  m(  1  —  p) 
v*  ~  m  —  pk 


□ 


(27) 
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Proof.  Note  that 


,.L(  n*\ 


where 


min  <  cTx  :  aj  x 


+  (1  —  t*)zi  >  1  V  i  =  1, . . .  ,  to,  ^  Zi  <  k,  x  £  R” ,  2  G  [0,  l]r 


(28) 


i= 1 


>  min|cTx:  pT  x  +  (1  — 

Cj  m  —  k 
li j  m  —  pk 


to  —  k  . 

,  P)Zi  >  1  V  1  =  1, . . 
to  —  pk 

m 

.  ,  TO,  ^^2j  <  k,  X  G  R”, 
i—1 

2G  [0,l]mJ(29) 

to  —  k  .  k  „  , , 

p)  >lVt=l,. 

m  —  pk  m 

.  .  ,  TO,  X  G  R"  | 

(30) 

j  G  ar  grain  l  —  :  j  =  \  . 

I  kj 


(31) 

(32) 


In  the  above,  (29)  follows  from  Lemma  2;  (30)  follows  from  aggregating  the  rows  of  the  LP  defined  in  (29) 
and  eliminating  the  z  variables;  and  (31)  follows  from  solving  the  single  constrained  LP  defined  in  (30). 


Note  that 


v*  =  min 


m 

cTx  :  ajx  +  Zi>  1  Vi  G  {1, . . . ,  to},  Zi  <  k,  x  G  R” , 

»= l 


2  G  {0, 1} 


Next  we  obtain  an  upper  bound  on  v* .  For  j  defined  in  (32): 

1.  Sort  aP s  from  smallest  to  largest. 

2.  Let  aj-  be  the  ( k  +  l)st  smallest  number. 

3.  Let  vH  =  This  corresponds  to  the  objective  function  value  of  the  feasible  solution  Xj  =  0  for  j  ^  j 

and  x-.  —  — .  Thus  v*  <  vH . 

*  °« 

Now  observe  that 


c-j  to  —  k 
/xj  m  -  pk 


<  vL  <  V*  < 


Therefore  using  the  definition  of  p  we  obtain  that, 

v*  —  vL  zH  —  vL  to(1  —  p) 

V*  ZH  TO  —  pk 


(33) 


(34) 

□ 


4  Mixing  Set  Inequalities 

In  this  section,  we  study  valid  inequalities  derived  from  a  mixing  set  relaxation  of  CKVLP.  A  mixing  set  is 
defined  as  follows: 

p  =  {(y.-)  e  R+  X  {0,1}”  :  y  +  hiZi  >hui=  l,...,n},  (35) 
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where  hi  >  /12  >  •  •  ■  >  hn.  The  mixing  set  was  introduced  by  Giinliik  and  Pochet  [10],  and  its  variants  in 
different  contexts  have  also  been  studied  in  [6,  15,  7,  21,  9,  11].  The  following  inequalities,  known  as  mixing 
(set)  inequalities ,  are  valid  for  P: 

1 

V +  ^2(htj  -  htj+1)ztj  >  htl  VT  =  {*!,...,*/}  C  N,  (36) 

l=i 

where  htl  >  ht2  >  •  •  •  >  htl  and  ht.l+1  :=  0.  Furthermore,  these  inequalities  can  be  separated  in  polynomial 
time,  are  facet-defining  for  P  when  t\  =  1,  and  are  sufficient  to  describe  the  convex  hull  of  P  [2,  10]. 

Recently,  the  mixing  set  inequalities  have  been  applied  to  solve  the  MIP  formulation  of  chance-constrained 
problems,  which  has  a  fc-violat ion- type  substructure,  i.e. ,  a  feasible  solution  must  satisfy  the  constraints 
corresponding  to  at  least  k  out  of  m  scenarios  [13,  14].  CKVLPs  can  be  viewed  as  a  special  case  of  this 
substructure  in  which  each  scenario  consists  of  only  one  covering  linear  constraint.  We  next  describe  and 
analyze  the  mixing  set  inequalities  for  CKVLPs. 

Let  the  set  of  ( x ,  z)-solutions  to  the  MIP  (4)  be  denoted  by  Xmip,  and  recall  from  (13)  that  the  set  of  re¬ 
solutions  to  (4)  is  denoted  by  X.  Note  that  X  is  the  projection  of  Vmip  into  x-space,  i.e.,  X  =  Projx(AMip). 
Following  [14],  we  can  obtain  a  mixing  set  relaxation  of  Xmip  as  follows.  Given  a  vector  a  £  R™,  calculate 
0? ,  i  £  {1, . . . ,  m}  as  below: 

:=  min  aTx 

s.t.  ajx  >  1,  x  £  R", 

where  a*  is  the  coefficient  vector  for  the  i-th  constraint  in  the  MIP  (4).  Assume  without  loss  of  generality 
that  0i  >  /3%  >  ...  >  0(f,  and  consider  the  following  set 

F(a):={(v)er+x{0,lf  :  ar x  +  (0?  -  0%+i) Zi  >  0? ,  i  =  l,...,k}.  (37) 

Proposition  5.  For  any  a  £  R™ ,  Amip  C  Y(a)  and  X  C  Projx.(Y (a)) 

Proof.  Let  (x,z)  £  A'mip-  Then  the  non- negativity  constraints  and  integrality  constraints  in  Y (a)  are  satisfied 
by  (x,z).  Without  loss  of  generality,  we  may  assume  that  the  indexes  1,  ...,k  in  Y(a)  are  the  first  k  indexes 
in  Xmip-  It  remains  to  verify  that  (x,  z)  satisfies  the  constraints  aTx  +  (0f  —  0 %+1)zi  >  0f  for  all  i  =  1, ...,  k. 

(i)  For  i  such  that  Zi= 1:  We  require  to  verify  that  aTx  >  /3j?+1-  Since  (x,z)  £  A'mip,  there  exists  some 

u  £  {1, . . . ,  k  +  1}  such  that  a^x  >  1.  Moreover  as  /3“  =  min{aTx  :  x  £  R"  ,  a^x  >  1},  we  obtain  that 

aTx  >  0“  >  0%+1,  where  the  last  inequality  is  due  to  the  fact  that  u  <  k  +  1. 

(ii)  For  i  such  that  Zi=  0:  We  require  to  verify  that  aTx  >  0f.  Since  (x,  z)  £  Amip,  we  obtain  that  aj x  >  1. 

Moreover  as  0f  =  min{o:Tx  :  x  £  R™,  aj x  >  1},  we  have  that  aTx  >  0f. 

Therefore,  (x,z)  £  Y(a)  and  Amip  C  Y(a).  The  result  X  C  Projx(V(a))  follows  from  the  fact  that 
A  =  Projx(AMip)-  □ 

The  set  Y(a)  is  a  valid  relaxation  of  Xmip  and  it  is  in  the  form  of  a  mixing  set.  This  can  be  noted  by 
considering  y  :=  (aT x  —  /3£+1)  as  a  nonnegative  continuous  variable  to  obtain  the  mixing  system 

y  +  ( 0 “  -  0k+i >  0?  -  0k+i  v  *  =  1, ... ,  k. 

Thus,  we  have  the  complete  description  of  conv(Y(a))  using  the  inequalities  (36),  which  are  also  valid  for 
Amip,  i.e.,  conv(XMip)  C  conv(Y(a)).  Let  us  call  HaeR"  conv(Y(a))  the  mixing  closure.  Clearly,  the  mixing 

closure  is  a  valid  relaxation  of  conv(XMip).  Let  pMIX  be  the  optimal  objective  value  of  optimizing  over  the 
mixing  closure,  and  v*  be  the  optimal  objective  value  of  the  MIP  (4).  Then,  the  best  root  node  gap  that  can 
be  potentially  achieved  by  the  mixing  inequality  procedure  is  bounded  by  (v*  —  vMlx)/v*.  To  study  this  gap 
quantitatively,  e.g.,  deriving  a  bound  for  (v*  —  vMlx)/v*,  we  analyze  the  projection  of  the  mixing  closure  on 
the  x-space,  i.e.,  Pi'oja,(nQ,eR"  conv(Y(a)))  in  the  following  subsections. 
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4.1  The  mixing  closure 

Note  that 

conv(X)  =  Projx(conv(XMip)))  C  Projx(  P  conv(F(a)))  (38) 

C  Pi  P r° j x ( conv ( Y ( a ) ) )  =  p  conv ( P ro j X(Y (a))). 

a;£R™ 

Thus,  minimizing  over  DaeiR"  conv(Projx(y(a)))  yields  a  lower  bound  for  r;MIX. 

Proposition  6.  Projx(F(a))  =  {x  G  R"  :  aTx  >  /3%+1}. 

Proof.  C:  Let  x  G  Projx(F(a)),  then  there  exists  z  G  {0,  l}fc  such  that  aTir  +  (/?“  —  /3%+1)zi  >  /3f,  i  =  1, ....,  k. 
Thus  aT x  >  /?f  (1  -  Zi)  +  Pk+1Zi  >  /3£+1  since  /3f  >  /J£+1  and  zt  G  [0, 1], 

D:  Let  x  G  {x  G  R™  :  aTx  >  /3£+1},  set  Zi  =  l,i  =  then  (x,z)  G  Y(a)  and  x  G  Projx(F(cr)).  □ 

Since  Projx(y(a))  is  a  half  space  in  the  non-negative  orthant  and  hence  convex,  the  convex  hull  operator  in 
flaeRj  conv(Projx(F(a)))  is  unnecessary. 

Proposition  7. 

P  conv(Projx(y(a)))  =  P  Projx(T(a))  =  P  {x  G  R+  :  aTx  >  ££+1}. 

agR™  agR£  agR^ 

Proposition  7  and  (38)  indicate  that  the  projection  of  the  mixing  closure  onto  the  at-space  is  contained  in  the 
closure  constituted  by  infinitely  many  half  spaces.  To  study  this  closure,  we  give  a  formal  definition  as  below: 

Definition  2  (Basic  Mixing  Closure).  The  Basic  Mixing  Closure  is  defined  as 

M  :=  P  {a:  G  R"  :  aTa;  >  /?“},  (39) 

agR™ 

where  /3a  :=  0%+1 . 

We  call  aT x  >  (3a  a  basic  mixing  inequality  corresponding  to  a.  In  order  to  understand  the  basic  mixing 
closure,  we  describe  another  class  of  inequalities. 

Definition  3  (Simple  Disjunctive  Cuts  and  Closure). 

1.  Select  a  subset  S  of  k  +  1  constraints.  Since  at  least  one  of  these  constraints  must  be  satisfied,  we  obtain 
the  simple  disjunction: 

(a^a;  >  1, ar  G  R")  V  (aj2x  >l,xG  R")  V  •  •  •  V  {ajk+1  x  >  l,x  G  R"),  (40) 

where  S  =  {h, . . .  4+i}- 

2.  Define  as  &  R"  as 

(fls)j  =  ma xieS{aij}  Vj  =  1,  ...,n. 

The  convex  hull  of  (40)  is 

(as)Ta;  >  1,  x  G  R" , 

and  we  call  (as)Ta;  >  1  a  simple  disjunctive  cut. 
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We  define  the  simple  disjunctive  closure  as 


V  :=  P|  {x  £  R"  :  ( as)Tx  >  1}. 

SC{l,...,m},|S|=fe+l 


(41) 


Proposition  8.  V  =  M. 

Proof.  T>  C  M:  For  any  given  a ,  without  loss  of  generality,  let  pi  >  . . .  Pk  >  Pk+i  >  >  j3m.  Then 

Pa  =  Pk+i-  Since  aT x  >  Pi  is  a  valid  inequality  for  the  set  {aj x  >  l,x  £  R"  }  Vi  =  1, k  +  1,  aT x  >  pa 
is  a  valid  inequality  for  the  convex  hull  of  the  set 

(a^z  >l,ig  R")  V  ( ajx  >  l,x  £  R")  V  ■  ■  •  V  ( aj+1x  >  l,x  £  R"), 

or  equivalently  aTx  >  pa  is  dominated  by  the  inequality  (os)Tcc  >  1. 

M  C  V:  Let  S  C  {1, ... ,  to}  such  that  151  =  k  +  1.  We  set  a  =  as-  Then  for  any  i  £  {1, . . . ,  to}, 

Pi  =  min  (as)T:c 

s.t.  (a,:)Ti>l,a:£R}. 

Since  <  {as)j,  we  obtain  that  Pi  =  min1<j<n-°pL  >  1.  Therefore,  Pas  >  1.  Hence,  the  basic  mixing 
inequality  is 

(as)Tx  >  P(as) 


which  dominates  the  inequality  ( as)Tx  >1.  D 

Because  to  and  k  are  finite  numbers,  the  number  of  simple  disjunctive  cuts  is  also  finite,  the  following  result 
is  immediate: 

Corollary  9.  A4  is  polyhedral. 

4.2  Bound  Quality 

Using  the  equivalence  of  T>  and  A4 ,  and  the  fact  that  T>  has  an  explicit  form  and  simple  structure,  we  derive 
a  lower  bound  for  ijmix  by  studying  V.  We  then  provide  a  bound  on  the  best  possible  gap  achievable  by  the 
addition  of  all  possible  mixing  inequalities,  i.e. ,  (v*  —  vMlx)/v*. 

Proposition  10.  Suppose  c  >  0  and  aij  >  0  for  all  i,j.  Let  a  =  min,?  {atj }  and  a  =  ma Xjj{ajj}.  Let  v*  be 
the  optimal  objective  value  over  X  and  vM  be  the  optimal  value  over  the  basic  mixing  closure,  then 

v*  —  riMIX  v*  —  vM  a  —  a 

0  <  - <  - <  - =. 

v*  v*  ~  a 

Proof.  Let  c  =  mingle,}.  Note  that  v*  <  min{cTa;  :  aj  x  >  IV  *  =  1, . . . ,  to,  x  £  R™}  <  min{cTa:  :  ( eTx )  > 
1/a,  x  £  R"}  =  c/a.  By  the  equivalence  of  V  and  M,  we  obtain  that  vM  —  min{cTa;  :  (as)T x  >  1  V5  C 
{1, ...,  to},  |5|  =  k+  l,x  £  R"  }  >  min{cTa;  :  a(eTa;)  >  1,  x  £  R"  }  =  c/a.  Thus,  (v*  —  vM)/v*  =  1  —  vM /v*  < 
1  —  (c/a)/(c/a)  =  (a  —  a) /a.  □ 

The  above  result  implies  that  the  relaxations  D  and  equivalently  M  can  be  tight  when  the  variation  of 
the  constraint  coefficients  is  small.  However,  the  separation  of  the  most  violated  simple  disjunctive  cut  from 
V  is  WP-complete.  Consider  an  arbitrary  x*  £  R"  that  we  want  to  separate.  Let  M  :=  {i  £  {1, .. .  ,m}  : 
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aj x*  <  1}.  Clearly,  \M\  >  k,  because  otherwise,  x*  belongs  to  the  feasible  region  of  the  fc-violation  problem 
A  and  therefore  belongs  to  V.  When  \M\  >  k  +  1,  we  solve  the  following  separation  problem: 

n 

r]  =  min  ^jXj  ~  1 
j= 1 

s.t.  7 tj  +  ciij'Wi  >  a,ij  j  =  1,  ...,n;  \/i  G  M 

^2  Wi  =  \M\  -  (k  + x) 

i€M 

7Tj  >  0  Vj  G  {1,.  ..,nj 
Wi  G  {0, 1}  V*  G  M, 

where  ttj  is  the  cut  coefficient  for  variable  Xj  and  Wi  is  a  binary  variable  taking  value  0  whenever  the  *-th  row 
is  considered  in  the  disjunction  (40).  The  inequality  Yl'j-i  7TMxi  —  ^  seParates  x*  from  V  if  and  only  if  77  <  0. 
This  separation  problem  is  NP-hard  [13].  Notice  that  although  the  mixing  closure  is  contained  in  V  and  sepa¬ 
ration  over  V  is  A/'T’-complete,  we  do  not  know  the  complexity  of  the  separation  over  Projx(f)Q  conv(T (a))). 


5  Branching  Scheme 

As  demonstrated  in  Table  1,  the  branch  and  bound  search  tree  could  be  enormously  large  even  for  a  small¬ 
sized  instance  of  the  MIP  (4).  Part  of  the  reason  for  the  excessive  number  of  nodes  is  the  overlap  in  the 
search  tree.  Without  loss  of  generality,  we  assume  that  Zj  is  the  binary  variable  to  branch  on  at  the  root 
node.  The  left  branch  with  Zj  fixed  at  zero  consists  of  the  following  set 

Bl  :=  {(x,  z)  :  ^2  <  k,  ajx  >  1,  (x,  z)  G  Wj^p}, 

where  A^lp  represents  the  set  Amip  with  the  constraint  aj x  +  Zj  >  1  dropped  and  the  variable  Zj  removed 
from  the  formulation.  The  right  branch  with  Zj  fixed  at  one  consists  of  the  following  set 

Br  :=  {(x,z)  :  ^Zi  <  k-  l,  ajx  >  0 ,(x,z)  G  A^jp}, 

which  is  the  union  of  the  following  two  sets: 

B>  :=  {{x,z)  :Y^Zi<k-l,aJx>l,  (x,z)  G  A^IP} 
i-Mi 

and 

B<  :=  {(x,z)  :  J2zi  <  k-  l,ajx  <  1  ,{x,z)  G  A^jp}. 
iMj 

Note  that  BR  is  in  fact  a  restriction  of  BL  and  hence  a  overlap  between  the  left  and  right  branches.  Re¬ 
exploring  BR  in  the  right  branch  is  a  redundancy  which  could  also  hinder  the  infeasibility-based  pruning: 
When  Br  is  infeasible  but  BR  is  feasible,  the  overall  right  branch  will  be  treated  as  a  feasible  node  that, 
otherwise,  would  have  been  pruned.  We  can  safely  take  BR  out  of  the  right  branch  and  the  remaining  search 
tree  will  still  cover  the  whole  solution  space.  This  logic  applies  to  any  node  with  a  zt  fixed  at  one. 

One  way  to  remove  the  overlap  from  the  search  tree  is  to  introduce  extra  constraints  and  use  a  big-M 
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formulation  to  model  the  dichotomy  of  ZjS: 

min  cTx 

aj  x  +  Zi  >  1  i=l,...,m 

aj x  +  Mzi  <  1  +  M  i  =  1 , . m 

m 

Zi  - k 

i= 1 

x  £  R" ,  Zi  £  {0, 1}  V*  =  l,...,m. 

With  this  big -M  formulation,  however,  the  number  of  constraints  doubles,  and  an  appropriate  large  number 
M  is  not  obvious.  Instead,  we  remove  the  overlap  during  the  branch-and-bound  process  as  follows:  whenever 
a  branching  variable  Zi  is  fixed  at  one,  we  reverse  the  sign  of  aj x  >  1  and  add  it  as  a  local  cut  to  this  node. 
The  addition  of  these  local  cuts  keeps  the  search  tree  compact  and  could  improve  infeasibility-based  node 
pruning.  Another  example  exploring  similar  infeasiblity-based  pruning  can  be  found  in  [4] . 


6  Computational  Experiments 

In  this  section,  we  examine  the  potential  impact  of  the  proposed  MIP  approaches  in  solving  two  classes  of 
problems  with  the  CKVLP  structure,  i.e.  MIPs  of  the  form  of  (4).  We  implement  the  algorithms  using 
CPLEX  callable  libraries  (version  12.1),  run  the  programs  on  Intel  Xeon  2.27  GHz  dual  core  Linux  servers 
installed  with  4  Gb  RAM,  and  compare  the  performance  against  the  CPLEX  MIP  solver  with  default  settings. 

6.1  Implementation  Details 

The  implementation  of  the  coefficient  strengthening  technique  (described  in  Section  3)  straightforwardly 
follows  Algorithm  1.  Notice  that,  we  could  obtain  a  tighter  by  enforcing  integrality  constraints  on  some 
binary  variables  in  X(C,  but  the  series  of  minimization  problems  in  Algorithm  1  would  become  more  time- 
consuming.  We  keep  X(t)  in  Algorithm  1  as  the  set  in  Definition  1.  The  threshold  parameter  A  is  chosen 
to  be  0.001. 

In  the  implementation  of  the  mixing  set  inequality  procedure  (described  in  Section  4),  we  add  cuts  only 
at  the  root  nodes  of  search  trees.  We  first  solve  the  root  node  LP  relaxation  and  obtain  an  optimal  solution 
(x,  z).  Next  we  select  the  vector  a  from  the  following  two  sets: 

•  those  constraint  vectors  cq’s  for  which  aj x  <  1;  and 

•  the  cost  vector  c,  if  all  cqs  have  been  used  as  a. 

Then  we  build  a  mixing  set  Y (a)  as  described  in  Section  4.  Other  than  the  most  violated  mixing  inequality 
from  (36),  we  also  add  violated  inequalities  (36)  with  |T|  =2  and  t\  =  1  to  the  root-node  LP  relaxation 
and  solve  it.  The  choice  of  these  inequalities  is  based  on  recommendations  in  [14].  We  iterate  this  process 
until  one  of  the  following  stopping  criteria  is  reached:  (1)  no  cut  with  a  violation  of  more  than  0.00001  is 
identified,  (2)  the  solution  time  exceeds  10,000  seconds,  or  (3)  the  cut  generation  procedure  has  run  for  1000 
iterations.  To  obtain  the  most  violated  mixing  inequality,  we  implemented  the  separation  algorithm  in  [2]. 
At  the  end  of  the  cut  generation  phase,  we  keep  only  the  tight  cuts  in  the  final  model  that  is  passed  on  to 
the  branch-and-bound  phase. 

In  the  implementation  of  the  branching  rule,  we  add  aj x  <  1  as  a  local  cut  to  the  nodes  in  which  zt  is 
fixed  at  one. 


15 


DISTRIBUTION  A:  Distribution  approved  for  public  release. 


6.2  Probabilistic  Portfolio  Optimization 

The  first  class  of  instances  we  test  are  from  the  probabilistically-constrained  portfolio  optimization  model  (2) 
introduced  in  Section  1.  This  problem  can  be  approximated  by  the  sample  approximation  approach  as  in  (3) 
and  reformulated  as  the  following  MIP  [16]: 


min  cTx 
s.t.  eTx  =  1 

aj x  +  rzi  >  r,  Vi  =  1, to 

m 

^2zi<k, 

i=i 

x  €  R" ,  Zi  £  {0, 1}  Vi  =  l,...,n. 

where  is  the  i- th  sample  drawn  from  the  distribution  of  di  and  k  =  [_ to  x  ej .  The  k- violation  substructure 
in  this  formulation  enforces  that  the  number  of  sampled  scenarios  in  which  the  overall  return  is  not  achieved 
must  not  exceed  [m  x  ej.  Hence,  ^ .  the  frequency,  approximates  the  risk  level  e.  The  constraint  eTx  =  1  is 
the  budget  constraint  obtained  by  scaling  the  investment  levels  to  a  unit  budget.  We  also  considered  instances 
where  there  is  no  budget  constraint. 

Each  component  of  is  drawn  from  an  independent  uniform  distribution  between  0.8  and  1.5,  which,  in 
this  context,  represents  the  range  between  a  20%  loss  on  one’s  investment  and  a  50%  profit.  The  required 
return  r  is  chosen  to  be  1.1,  and  e  is  set  at  0.05,  indicating  a  ten  percent  average  return  with  a  probability  of 
95%.  We  set  n  =  20,  to  =  200,  and  k  =  15,  allowing,  at  most,  15  of  200  linear  inequalities  to  be  violated.  The 
cost  coefficients  in  the  model  with  a  budget  constraint  take  on  integer  values  uniformly  distributed  between 
1  and  100.  For  the  model  without  the  budget  constraint,  we  use  the  vector  with  all  components  equal  to  one 
as  the  cost  vector,  since  the  instances  with  this  particular  cost  vector  are  especially  difficult  to  solve.  We 
select  ten  randomly  generated  instances  for  each  model  that  can  be  solved  by  CPLEX  within  ten  hours,  and 
compare  the  proposed  methods  against  CPLEX  with  default  settings. 

Tables  2  and  3  present  the  computational  results  for  the  model  with  a  budget  constraint  and  the  model 
without  a  budget  constraint,  respectively.  The  first  column  gives  the  instance  number.  The  second  and  third 
columns  give  the  branch-and-bound  (B&B)  time  (in  seconds)  and  nodes  of  the  CPLEX  MIP  solver  (CPX). 
Columns  4-6  give  the  root  node  gap  closed  by  the  cuts  generated  by  CPX,  the  coefficient  strengthening  (CS), 
and  the  mixing  set  inequalities  (MIX),  respectively.  Finally,  columns  7-9  and  10-12  compare  the  percentage 
improvements  of  the  three  schemes:  the  branching  rule  (BR),  CS,  and  MIX,  over  the  CPLEX  MIP  solver  on 
branch-and-bound  time  and  nodes,  respectively.  The  percentage  improvement  in  time  for  BR  is  computed  as 
100  x  (Time(CPX)  -  Time(BR))/Time(CPX),  where  Time(CPX)  is  the  branch-and-bound  time  for  default 
CPLEX  and  Time(BR)  is  the  branch-and-bound  time  using  the  proposed  branching  rule.  The  percentage 
improvements  for  the  other  two  schemes,  and  the  nodes  saved  are  computed  analogously. 

The  reported  solution  times  are  only  for  the  branch-and-bound  phase  of  the  overall  procedure.  The  mixing 
set  cutting  plane  algorithm  spends  20  to  30  seconds  on  root  node  until  no  more  cuts  can  be  separated.  The 
time  spent  on  coefficient  strengthening,  which  amounts  to  solving  a  series  of  linear  programming  problems,  is 
under  20  seconds.  The  local  cuts  added  in  the  branching  scheme  can  be  obtained  instantly  by  simply  reversing 
the  sign  of  the  corresponding  constraint.  Since  the  preprocessing  times  in  these  instances  are  negligible  in 
comparison  with  the  branch-and-bound  times,  we  do  not  include  them  in  the  solution  time. 
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Table  2:  Percentage  Improvements  Over  CPLEX  (Portfolio  Optimization  with  Budget  Constraint) 


Instance 

Number 

CPX  Default 

B&B  time  B&B  nodes 

Root  Gap  Closed 
CPX  CS  MIX 

B&B  Time  Saved 
BR  CS  MIX 

B&B  Nodes  Saved 
BR  CS  MIX 

1 

11,094 

13,640,260 

19% 

24% 

25% 

91% 

89% 

60% 

89% 

85% 

62% 

2 

968 

1,657,606 

19% 

23% 

24% 

75% 

82% 

51% 

70% 

79% 

54% 

3 

223 

505,037 

36% 

36% 

41% 

77% 

86% 

25% 

77% 

86% 

36% 

4 

19,830 

25,651,409 

7% 

10% 

13% 

90% 

91% 

79% 

88% 

89% 

80% 

5 

400 

786,701 

29% 

31% 

33% 

70% 

69% 

-4% 

66% 

62% 

-3% 

6 

5,044 

9,786,835 

14% 

19% 

22% 

68% 

82% 

27% 

66% 

80% 

37% 

7 

10,923 

14,365,495 

29% 

33% 

35% 

82% 

88% 

26% 

80% 

85% 

35% 

8 

1,822 

3,177,889 

12% 

18% 

21% 

87% 

90% 

20% 

86% 

88% 

35% 

9 

2,115 

3,454,682 

17% 

24% 

28% 

71% 

69% 

-17% 

66% 

65% 

-2% 

10 

13,017 

10,526,548 

15% 

18% 

21% 

75% 

88% 

11% 

79% 

76% 

-15% 

Average 

6,544 

8,355,246 

20% 

24% 

26% 

79% 

83% 

28% 

77% 

80% 

32% 

Table  3:  Percentage  Improvements  Over  CPLEX  (Portfolio  Optimization  Without  Budget  Constraint) 


Instance 

Number 

CPX  Default 

B&B  Time  B&B  Nodes 

Root  Gap  Closed 
CPX  CS  MIX 

B&B  Time  Saved 
BR  CS  MIX 

B&B  Node  Saved 
BR  CS  MIX 

1 

11,903 

23,786,322 

2% 

64% 

57% 

-15% 

85% 

-176% 

22% 

91% 

85% 

2 

14,584 

24,366,521 

4% 

66% 

61% 

29% 

95% 

65% 

38% 

95% 

89% 

3 

8,730 

17,586,672 

2% 

64% 

58% 

14% 

92% 

-181% 

19% 

93% 

88% 

4 

5,516 

10,898,121 

5% 

64% 

59% 

7% 

90% 

51% 

19% 

91% 

83% 

5 

12,462 

18,021,273 

4% 

66% 

62% 

19% 

95% 

65% 

19% 

94% 

89% 

6 

21,475 

30,948,921 

2% 

64% 

58% 

58% 

92% 

65% 

46% 

93% 

84% 

7 

6,928 

14,634,688 

2% 

64% 

60% 

-27% 

86% 

44% 

17% 

88% 

80% 

8 

15,547 

20,957,656 

2% 

65% 

61% 

42% 

93% 

68% 

33% 

94% 

89% 

9 

34,512 

68,752,624 

2% 

64% 

55% 

41% 

89% 

63% 

50% 

94% 

84% 

10 

5,314 

9,376,843 

2% 

65% 

60% 

-2% 

94% 

69% 

14% 

94% 

88% 

Average 

13,697 

23,932,964 

3% 

65% 

59% 

17% 

91% 

13% 

28% 

93% 

86% 

From  Tables  2  and  3  we  observe  that  the  mixing  set  inequalities  and  coefficient  strengthening  have  compa¬ 
rable  performance  in  terms  of  closing  root  node  gaps.  They  both  close  more  gap  than  the  CPLEX  default  cuts, 
especially  in  the  model  without  a  budget  constraint.  However,  in  the  brancli-and-bound  process  afterwards, 
the  mixing  set  inequalities  cannot  take  full  advantage  of  the  tighter  lower  bounds  to  reduce  overall  time  and 
nodes.  In  fact,  in  four  of  the  20  instances,  the  mixing  set  inequalities  even  worsen  the  performance.  The 
reason  lies  in  the  difficulty  of  selecting  effective  cuts  to  keep  in  the  model  throughout  the  branch-and-bound 
process.  In  our  experiment,  we  also  try  to  employ  the  CPLEX  cut  pool  to  dynamically  manage  all  the  cuts 
generated  at  root  nodes,  but  we  have  not  been  successful  in  identifying  the  most  useful  cuts. 

The  coefficient  strengthening  technique  closes  gap  amounts  similar  to  those  closed  by  the  mixing  inequal¬ 
ities,  but  the  improvement  in  the  overall  branch-and-bound  process  is  significantly  larger  than  for  the  mixing 
inequalities.  The  coefficient  strengthening  is  able  to  cut  down  the  time  and  nodes  by  an  average  of  over  80%. 
This  achievement  can  be  attributed  to  the  fact  that  the  coefficient  strengthening  tightens  the  lower  bound 
without  introducing  any  extra  variables  or  constraints  at  the  root  node. 

The  branching  rule  performs  remarkably  better  in  the  model  with  the  budget  constraint,  over  70%  savings 
on  nodes  and  time  versus  less  than  30%  savings  in  the  model  without  the  budget  constraint.  This  sizable 
difference  can  be  explained  by  the  presence  of  the  budget  constraint.  The  budget  constraint,  as  one  type 
of  side  constraint,  greatly  reduces  the  feasible  region  of  the  node  problems.  Consequently,  the  feasibility  of 
the  node  problems  that  have  budget  constraints  is  more  sensitive  to  the  addition  of  local  cuts  obtained  by 
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reversing  the  signs  of  the  corresponding  covering  inequalities.  Therefore,  adding  the  local  cuts  to  the  models 
with  budget  constraints  is  more  likely  to  lead  to  infeasible  node  problems,  triggering  the  infeasibility-based 
node  pruning  more  frequently. 

6.3  Optimal  Vaccination  Allocation 

The  second  class  of  test  instances  is  the  optimal  vaccination  allocation  problem  under  uncertainty  addressed 
in  [18].  In  this  application,  a  scarce  vaccine  is  allocated  to  households  in  a  community  to  prevent  an  epidemic 
from  breaking  out  by  restricting  the  post-vaccination  reproductive  number  to  be  strictly  less  than  one.  The 
sample  average  approximation  approach  to  this  problem  yields  a  MIP  formulation  which  has  a  CKVLP 
structure,  plus  some  side  constraints  of  form  YliesXi  =  where  S  is  some  subset  of  the  index  set  of  the 
decision  variables.  A  full  description  of  the  model  is  provided  in  the  Appendix.  We  use  the  same  test  instances 
of  this  problem  as  in  [18].  These  instances  have  302  continuous  variables  and  m  binary  variables  (see  Column 
1  in  Table  4).  The  risk  level  e  is  set  to  0.05,  and  the  value  of  k  can  be  determined  accordingly  by  k  —  [m  x  ej . 

Table  4  compares  the  performance  of  three  schemes  against  the  performance  of  the  CPLEX  MIP  solver. 
The  first  two  columns  describe  the  sizes  of  the  instances.  The  next  three  columns  provide  the  root  node  gaps 
closed  by  the  cuts  generated  by  the  CPLEX  MIP  solver,  the  coefficient  strengthening  procedure,  and  the  mix¬ 
ing  inequalities,  respectively.  Columns  6-7  present  the  time  (in  seconds)  spent  on  coefficient  strengthening  and 
generating  mixing  inequalities  at  the  root  node,  respectively.  Columns  8-11  and  columns  12-15  compare  the 
time  (in  seconds)  and  the  number  of  nodes  in  the  branch-and-bound  phase  by  the  CPLEX  MIP  solver  and  the 
three  proposed  schemes,  respectively.  Table  5  summarizes  the  percentage  improvements  of  the  three  schemes 
over  the  CPLEX  MIP  solver  with  default  settings.  The  percentage  improvements  in  total  time  (root  node 
time  +  branch-and-bound  time)  for  CS  is  computed  as  100  x  (Time(CPX)  -  Time(CS))/Time(CPX),  where 
Time(CPX)  is  the  total  time  for  default  CPLEX  and  Time(CS)  is  the  total  time  using  coefficient  strength¬ 
ening.  The  percentage  improvements  in  the  branch-and-bound  time  (excluding  the  coefficient  strengthening 
time)  and  the  nodes  saved  are  computed  analogously.  The  percentage  improvements  for  MIX  and  BR  are 
computed  similarly. 


Table  5:  Percentage  Improvements  Over  CPLEX  (Optimal  Vaccination  Allocation  Problem) 


Size 

B&B  Node  Saved 

B&B  Time  Saved 

Total  Time  Saved 

m 

k 

CS 

MIX 

BR 

CS 

MIX 

BR 

CS 

MIX 

BR 

250 

12 

43% 

17% 

40% 

100% 

100% 

50% 

-5033% 

-7669% 

50% 

88% 

81% 

80% 

100% 

100% 

33% 

-3220% 

-5237% 

33% 

92% 

95% 

92% 

100% 

100% 

60% 

-2186% 

-3257% 

60% 

99% 

96% 

95% 

100% 

100% 

78% 

-1037% 

-1836% 

78% 

78% 

84% 

78% 

100% 

100% 

50% 

-5191% 

-7379% 

50% 

500 

25 

96% 

88% 

86% 

90% 

86% 

69% 

-1083% 

-4142% 

69% 

95% 

87% 

92% 

91% 

78% 

78% 

-701% 

-2637% 

78% 

96% 

93% 

84% 

93% 

82% 

61% 

-1808% 

-6459% 

61% 

91% 

91% 

82% 

91% 

82% 

50% 

-2251% 

-7785% 

50% 

100% 

100% 

98% 

98% 

96% 

82% 

-913% 

-3833% 

82% 

750 

37 

100% 

99% 

97% 

99% 

99% 

96% 

-24% 

-590% 

96% 

78% 

72% 

-82% 

75% 

74% 

-159% 

-2506% 

-14788% 

-159% 

95% 

91% 

84% 

89% 

83% 

57% 

-1418% 

-8252% 

57% 

97% 

96% 

83% 

97% 

93% 

63% 

-380% 

-3399% 

63% 

95% 

94% 

89% 

94% 

86% 

73% 

-764% 

-6194% 

73% 

1000 

50 

100% 

99% 

99% 

100% 

99% 

97% 

54% 

-58% 

97% 

97% 

96% 

96% 

98% 

96% 

87% 

12% 

-180% 

87% 

99% 

96% 

96% 

99% 

96% 

92% 

30% 

-127% 

92% 

91% 

46% 

67% 

93% 

61% 

47% 

-827% 

-2800% 

47% 

88% 

59% 

54% 

88% 

55% 

7% 

-1498% 

-5689% 

7% 

2000 

100 

99% 

78% 

97% 

99% 

78% 

92% 

93% 

72% 

92% 

99% 

59% 

99% 

99% 

52% 

96% 

97% 

49% 

96% 

100% 

87% 

100% 

99% 

85% 

98% 

96% 

81% 

98% 

98% 

85% 

99% 

99% 

82% 

96% 

91% 

74% 

96% 

100% 

30% 

100% 

100% 

14% 

98% 

98% 

12% 

98% 
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Table  4:  Computational  Results  (Optimal  Vaccination  Allocation  Problem) 


Size 
m  k 

Root  Gap  Closed 
CPX  CS  MIX 

Root  Node  Time 
CS  MIX 

CPX 

CS 

B&B  Time 

MIX  BR 

CPX 

CS 

B&B  Nodes 
MIX 

BR 

250 

12 

62% 

95% 

95% 

103 

155 

2 

0 

0 

1 

81 

46 

67 

49 

61% 

93% 

92% 

100 

160 

3 

0 

0 

2 

673 

81 

129 

134 

64% 

93% 

93% 

114 

168 

5 

0 

0 

2 

1,096 

83 

52 

91 

58% 

92% 

90% 

102 

174 

9 

0 

0 

2 

3,054 

39 

112 

149 

55% 

94% 

94% 

106 

150 

2 

0 

0 

1 

153 

33 

24 

33 

500 

25 

52% 

91% 

89% 

493 

1,776 

42 

4 

6 

13 

7,082 

286 

820 

995 

50% 

93% 

90% 

507 

1,738 

64 

6 

14 

14 

20,166 

1,004 

2,600 

1,551 

55% 

91% 

90% 

532 

1,832 

28 

2 

5 

11 

5,329 

218 

390 

839 

54% 

90% 

90% 

515 

1,731 

22 

2 

4 

11 

3,907 

347 

359 

694 

48% 

94% 

92% 

506 

1,964 

50 

1 

2 

9 

18,822 

38 

26 

458 

750 

37 

30% 

91% 

90% 

1,656 

9,230 

1,340 

7 

16 

59 

206,179 

463 

1,648 

5,276 

44% 

91% 

91% 

1,574 

9,066 

61 

15 

16 

158 

7,124 

1,554 

1,989 

12,944 

49% 

93% 

92% 

1,522 

8,418 

101 

11 

17 

43 

16,857 

804 

1,436 

2,729 

45% 

91% 

90% 

1,106 

8,101 

232 

8 

16 

87 

40,393 

1,057 

1,497 

6,692 

42% 

91% 

90% 

1,116 

8,164 

130 

8 

18 

35 

23,886 

1,198 

1,332 

2,658 

1000 

50 

26% 

90% 

89% 

2,928 

10,162 

6,449 

30 

35 

182 

829,738 

3,142 

4,773 

6,351 

32% 

91% 

89% 

3,107 

10,056 

3,636 

86 

143 

482 

433,591 

11,998 

16,333 

15,915 

29% 

92% 

90% 

3,157 

10,144 

4,546 

28 

166 

376 

348,976 

2,552 

13,239 

15,253 

38% 

91% 

88% 

3,218 

10,012 

350 

25 

137 

185 

28,423 

2,424 

15,406 

9,278 

33% 

91% 

88% 

2,791 

10,109 

176 

21 

79 

163 

15,241 

1,821 

6,310 

7,065 

2000 

100 

16% 

88% 

84% 

10,684 

10,804 

166,074 

843 

36,096 

13,992 

9,978,113 

57,601 

2,237,052 

294,053 

15% 

89% 

85% 

10,259 

11,084 

386,246 

2,740 

185,168 

15,960 

24,523,780 

191,138 

10,109,370 

316,629 

14% 

89% 

86% 

10,570 

11,276 

324,023 

1,779 

49,923 

6,305 

24,463,163 

116,273 

3,297,139 

114,177 

15% 

89% 

85% 

11,082 

11,091 

141,172 

2,023 

24,990 

5,129 

9,032,616 

138,575 

1,350,094 

89,084 

15% 

89% 

84% 

11,257 

11,448 

574,819 

1,889 

493,400 

10,795 

39,500,399 

128,181 

27,842,365 

184,543 

The  results  in  Table  4  and  5  show  the  effectiveness  of  the  coefficient  strengthening  technique  in  both 
closing  root  node  gaps  and  reducing  nodes  and  time  of  the  branch-and-bound  phase.  We  observe  that  the 
performance  of  the  coefficient  strengthening  algorithm  is  significantly  more  consistent  than  the  other  two 
methods  and  exhibits  a  certain  stability.  For  example  when  m  =  1000,  the  branch-and-bound  time  saved 
by  the  branching  scheme  ranges  from  7.4%  to  97.2%;  the  branch-and-bound  time  saved  by  the  mixing  set 
inequalities  ranges  from  55.1%  to  99.5%;  in  contrast,  the  coefficient  strengthening  algorithm  varies  only 
from  88.1%  to  99.5%.  This  consistent  behavior  is  also  observed  for  the  probabilistic  portfolio  optimization 
instances  in  Tables  2  and  3.  The  branching  scheme  has  a  comparable  impact  on  reducing  the  search  tree  size 
to  the  coefficient  strengthening  in  the  vaccination  instances,  especially  for  the  difficult  ones  with  m  =  2000. 
Since  this  model  consists  of  equalities  as  side  constraints,  the  local  cuts  added  by  the  branching  rule  cause 
infeasibility  in  the  node  problems  frequently,  therefore,  effectively  reducing  the  search  tree  size. 

The  performance  improvement  in  the  branch-and-bound  phase  comes  at  the  expense  of  computational 
effort  in  coefficient  strengthening  and  separation  of  mixing  inequalities  at  the  root  node.  Unlike  the  portfolio 
optimization  instances,  this  effort  is  quite  significant  for  the  vaccination  instances  (see  columns  6-7  in  Table 
4).  Each  iteration  of  the  coefficient  strengthening  requires  solving  m  linear  programs  -  for  the  instances  with 
m  =  1000  and  m  =  2000,  several  thousand  linear  programs  need  to  be  solved.  Similarly,  in  generating  the 
mixing  set  inequalities,  m  linear  programs  need  to  be  solved  in  order  to  form  one  mixing  set  for  a  given  a, 
and  there  are  m  possible  choices  for  a.  Accordingly,  the  cut  generation  time  increases  in  the  order  of  m2. 
Comparing  column  8  in  Table  4  and  column  9  in  Table  5,  we  observe  that  significant  effort  on  coefficient 
strengthening  is  not  justifiable  for  instances  that  CPLEX  can  solve  in  under  1500  seconds.  For  example, 
for  the  instances  with  m  =  1000,  the  coefficient  strengthening  technique  takes  around  3000  seconds.  Recall 
that  we  impose  a  time  limit  of  10000  seconds,  so  for  these  instances  coefficient  strengthening  is  run  till  no 
coefficients  can  be  further  tightened.  Considering  the  fact  that  CPLEX  takes  only  one  to  two  hours  to  solve 
these  instances,  running  the  strengthening  procedure  to  termination  is  not  economical.  Similarly,  we  observe 
(by  comparing  column  8  in  Table  4  and  column  10  in  Table  5)  that  the  effort  on  mixing  inequalities  is  not 
justified  for  instances  with  m  <  2000  that  CPLEX  can  solve  within  6500  seconds.  On  the  overall  solution 
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time,  the  branching  rule  has  a  more  consistent  performance  since  it  requires  no  additional  effort  at  the  root 
node.  For  the  larger  size  instances  with  m  =  2000,  it  is  worth  spending  about  three  hours  on  strengthening 
to  reduce  the  branch-and-bound  time  from  days  to  minutes.  The  CPLEX  MIP  solver  takes  one  to  six  days  to 
solve  these  instances  to  optimality,  whereas  the  coefficient  strengthening  reduces  the  overall  effort  to  under 
four  hours. 


7  Concluding  Remarks 

In  this  paper,  we  study  covering-type  fc-violation  linear  programs.  We  show  that  such  problems  are  strongly 
NP-hard,  and  study  empirically  the  computational  difficulty  of  MIP-based  approaches  for  these  problems. 
We  introduce  and  analyze  a  coefficient  strengthening  scheme,  adapt  and  analyze  an  existing  cutting  plane 
technique,  and  present  a  branching  technique  to  improve  the  performance  of  MIP  approaches.  Computational 
experiments  on  two  classes  of  problems  show  that  the  proposed  methods  are  effective  in  significantly  reducing 
running  times.  The  coefficient  strengthening  is  most  effective  for  large  instances  and  reduces  the  solution 
time  and  the  number  of  search  tree  nodes  by  80%  to  98%  in  these  instances.  The  branching  scheme  reduces 
the  size  of  search  trees  by  removing  overlaps  between  branches  and  incurring  infeasiblity-based  node  pruning. 
It  takes  no  effort  to  implement  and  works  most  effectively  on  the  CKVLP  models  with  side  constraints.  The 
mixing  set  cuts  are  capable  of  closing  a  large  percentage  of  root  node  gaps.  However,  the  impact  of  these 
cuts  on  the  branch-and-bound  process  are  mixed.  Perhaps  better  performance  might  be  achieved  by  a  more 
effective  separation  procedure  for  mixing  inequalities.  We  have  also  investigated  the  performance  of  various 
combinations  of  the  three  schemes,  but  the  gains  are  not  significant. 
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Appendix 

Optimal  Vaccination  Allocation  Model  of  [18] 

The  vaccination  allocation  problem  allocate  a  scarce  vaccine  to  households  in  a  community  to  prevent  an 
epidemic  from  breaking  out.  The  epidemic  will  die  out  if  the  post-vaccination  reproductive  number  is  strictly 
less  than  one.  Assume  a  community  has  a  set  F  of  types  of  households  and  each  type  of  household  /  £  F 
consists  of  a  combination  of  person  types  t  £  T,  e.g.,  child,  adult,  or  elderly.  A  vaccination  policy  v  £  V 
is  a  delivery  of  vaccine  to  certain  types  of  persons  in  a  household  f  £  F.  For  example,  a  vaccination  policy 
could  be  a  delivery  of  vaccine  only  to  the  two  children  in  a  household  type  that  consists  of  two  adults  and 
two  children.  The  decision  problem  is  to  determine  an  implementation  of  vaccination  policies  for  each  type 
of  household  in  this  community  with  a  minimal  cost  which  guarantees  that  the  post-vaccination  reproductive 
number  is  strictly  below  one  with  a  high  probability  1  —  e.  We  state  below  the  probabilistically-constrained 
model  in  [18]: 
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mm  : 


E  E  E  VthfXfv 

feF  vev  teT 
s.t.  E  xfv  =  1  V/eF 
vGV 

P{  E  E  afv(u)xfv  <  1}  >  1  -  e 

feFv&v 

0<xfv<l  V/  G  F,  v  G  V, 

where  Xfv  is  the  decision  variable  representing  the  percentage  of  policy  v  to  be  implemented  for  household 
type  />  Vt  is  the  number  of  people  of  type  t  vaccinated  in  policy  v,  hf  is  the  proportion  of  households 
in  the  community  that  are  of  type  /,  and  a/„(w)  is  the  computed  random  parameter  for  impact  of  the 
vaccination  policy  v  for  household  type  /,  which  is  a  function  of  different  random  numbers  following  some 
known  distributions.  For  more  details,  see  [3,  18]. 

After  m  i.i.cl.  samples  are  taken  from  a/„(w)s,  the  above  probabilistically-constrained  problem  can  be 
approximated  by  the  following  MIP,  which  has  a  CKVLP  structure: 


max  :  E  E  E  vthfx'fv  -  E  E  E  Vthf 

fGFvevter  feFvevter 

s-t-  E  x'fv  =  1  V/  G  F 

vev 

E  E  a)vx'fv  +  biZi  ^bi  i=  !)  •••.  m 
feFvev 

m 

E  zi  <  k 

o  <  x'fv  <1  V/  G  F,  V  ev,  Zi  G  {0, 1}  i  =  1, ..,  m, 
where  a^v  is  the  ?'-th  sample  of  afv(ui),  x'^v  =  1  —  Xfv,  bi  =  E/gf  Evev  aE  —  1)  an(i  k  =  [e  x  mj . 
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Abstract 

We  study  a  variant  of  the  classical  transportation  problem  in  which  suppliers  with  limited  capacities 
have  a  choice  of  which  demands  (markets)  to  satisfy.  We  refer  to  this  problem  as  the  transportation 
problem  with  market  choice  (TPMC).  While  the  classical  transportation  problem  is  known  to  be  strongly 
polynomial-time  solvable,  we  show  that  its  market  choice  counterpart  is  strongly  NP-complete.  For  the 
special  case  when  all  potential  demands  are  no  greater  than  two,  we  show  that  the  problem  reduces 
in  polynomial  time  to  minimum  weight  perfect  matching  in  a  general  graph,  and  thus  can  be  solved 
in  polynomial  time.  Next,  we  consider  the  convex  hull  of  solutions  to  the  problem  when  a  cardinality 
constraint  is  introduced  on  the  number  of  rejected  markets.  We  show  that  the  cardinality  constraint  does 
not  introduce  new  fractional  extreme  points  for  the  case  when  TPMC  is  polynomially  solvable.  We  give 
valid  inequalities  and  coefficient  update  schemes  for  general  mixed-integer  sets  that  are  substructures  of 
TPMC.  Finally,  we  give  conditions  under  which  these  inequalities  define  facets,  and  report  our  preliminary 
computational  experiments  with  using  them  in  a  branch-and-cut  algorithm. 

Keywords:  Transportation  problem,  market  choice,  complexity,  cardinality  constraint,  facet 


1  Introduction 

We  consider  a  variant  of  the  classical  transportation  problem  in  which  suppliers  with  limited  capacities  have 
a  choice  of  which  demands  (markets)  to  satisfy.  In  this  problem,  if  a  market  is  selected  its  demand  must  be 
satisfied  fully  through  shipments  from  the  suppliers.  If  a  market  is  rejected,  then  the  corresponding  potential 
revenue  is  lost.  The  objective  is  to  minimize  the  total  cost  of  shipping  and  lost  revenues.  We  refer  to  this 
problem  as  the  transportation  problem  with  market  choice  (TPMC). 

More  formally,  we  are  given  a  set  of  supply  and  demand  nodes  that  form  a  bipartite  graph  &'(Vj  U  V2,  E). 
The  nodes  in  set  Vj  represent  the  supply  nodes,  where  for  i  £  V\,  S;  £  N  represents  the  capacity  of  supplier 
i.  The  nodes  in  set  V2  represent  the  potential  markets,  where  for  j  £  V2,  dj  £  N  represents  the  demand 
of  market  j.  The  edges  between  supply  and  demand  nodes  have  weights  that  represent  shipping  costs  Wij, 
where  (i,j)  £  E.  For  each  j  £  V2,  rj  is  the  revenue  lost  if  the  market  j  is  rejected.  For  a  given  vector  of 
parameters  7 j  for  j  £  S  and  S'  C  5,  we  let  7 (S')  :=  7 j,  throughout  the  paper. 

Let  Xij  be  the  amount  of  demand  of  market  j  satisfied  by  supplier  i  for  (i.  j)  £  E,  and  let  Zj  be  an 
indicator  variable  taking  a  value  1  if  market  j  is  rejected  and  0  otherwise.  A  mixed-integer  programming 
(MIP)  formulation  of  the  problem  is  given  where  the  objective  is  to  minimize  the  transportation  costs  and 
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the  lost  revenues  due  to  unchosen  markets: 


X  WVXV  +  X  o-i 

(la) 

( i,j)£E  jev2 

^  xij  —  dj(  1  Zj) 

Vj  £  V2 

(lb) 

i:(i,j)£E 

^  ^  %ij  — 

Vi  £  Vi 

(lc) 

j:(i,j)£E 

Z  £  {0, 1}I^I 

(Id) 

x  £  Rif1. 

(le) 

We  refer  to  problem  description  (la)-(le)  as  TPMC.  The  first  set  of  constraints  (lb)  is  the  demand  con¬ 
straint.  In  TPMC  either  a  demand  for  a  market  is  fully  satisfied  or  rejected  altogether,  which  necessitates 
the  introduction  of  the  additional  binary  variables.  The  second  set  of  constraints  (lc)  model  the  supply 
restrictions. 

TPMC  is  closely  related  to  the  capacitated  facility  location  (CFL)  problem.  In  CFL,  given  a  set  of 
potential  facilities  j  £  Vi  with  capacities  dj,j  £  Vi  and  customers  i  £  V\  with  demands  Si,i  £  Vi ,  we  would 
like  to  determine  which  facilities  to  open  so  that  the  demand  of  all  customers  can  be  satisfied  from  shipments 
from  the  open  facilities.  A  MIP  formulation  of  CFL  is 


/*  xij  A  djZj 

V  j  £  V2 

(2a) 

XI  Xij  = 

Vi  £  Vi 

(2b) 

j:{i,j)£E 

Z£{  0, 1}I^I 

(2c) 

x  £  Rif1. 

(2d) 

Therefore  one  may  view  the  CFL  problem  as  a  ‘complement’  of  the  TPMC  problem  where  the  constraints 

(lb)  and  (lc)  of  TPMC  change  signs  in  the  constraints  (2a)  and  (2b)  in  CFL  respectively.  Note  that  there  is 
no  straightforward  way  of  ‘complementing’  the  variables  of  TPMC  in  order  to  construct  an  instance  of  CFL 
or  vice  versa.  While  the  CFL  problem  has  been  extensively  studied  with  respect  to  its  complexity,  polyhedral 
structure,  and  approximability  ([1,  8]  and  references  therein),  TPMC  is  less  understood. 

Recently,  approximation  algorithms  and  heuristics  have  been  proposed  for  various  supply  chain  planning 
and  logistics  problems  with  market  choice  [11,  18].  It  is  assumed  that  these  problems  are  uncapacitated  or 
that  they  have  soft  capacities.  A  two-stage  approach  is  utilized  in  solving  these  classes  of  problems  that 
admit  a  facility  location  formulation.  In  the  first  stage,  the  problem  is  to  determine  a  subset  of  markets  and 
reject  the  others.  In  the  second  stage,  the  goal  is  to  minimize  the  production  cost  and  lost  revenues  due  to 
unselected  markets.  In  particular,  for  the  uncapacitated  lot-sizing  problem,  the  facility  location  formulation  is 
used  to  model  the  market  choice  counterpart.  It  is  shown  that  the  LP  relaxation  solution  can  be  rounded  in 
a  way  that  guarantees  a  constant  factor  approximation  algorithm.  However,  this  algorithm  relies  on  scaling 
continuous  variables  up,  so  it  does  not  immediately  generalize  to  our  problem  with  hard  capacity  constraints 

(lc) .  Van  den  Heuvel  et  al.  [25]  consider  a  maximization  version  of  the  same  problem  and  show  that  no 
constant  factor  approximation  algorithm  exists  for  this  version,  unless  P=NP.  The  authors  also  give  several 
polynomially  solvable  special  cases,  and  test  heuristics  for  the  general  case. 

The  rest  of  the  paper  is  organized  as  follows.  In  Section  2  we  explore  the  complexity  of  TPMC.  We 
show  that  while  the  classical  transportation  problem  admits  a  strongly  polynomial  algorithm  [16],  its  market 
choice  counterpart  is  strongly  NP-complete.  We  also  identify  a  polynomially  solvable  case  when  the  demands 
of  all  potential  markets  are  no  more  than  two.  In  Section  3  we  consider  a  version  of  the  problem  with 
a  service  level  constraint  on  the  maximum  number  of  markets  that  can  be  rejected.  We  show  that  for 
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the  case  in  which  the  original  problem  is  polynomial,  its  cardinality-constrained  version  is  also  polynomial. 
Furthermore,  in  this  case,  we  show  that  adding  the  cardinality  constraint  to  the  convex  hull  of  solutions  to 
the  original  problem  does  not  create  any  new  fractional  extreme  points.  In  Section  4  we  present  methods 
for  constructing  valid  inequalities  for  mixed  integer  cover  sets  and  mixed-integer  knapsack  sets  with  variable 
upper  bound  constraints,  which  appear  as  substructures  of  TPMC.  We  show  that  these  methods  are  useful 
for  generating  valid  inequalities  for  TPMC.  We  also  study  the  strength  of  the  proposed  valid  inequalities.  Our 
preliminary  computations,  summarized  in  Section  5,  show  that  there  is  a  reduction  in  the  root  gap  when  our 
valid  inequalities  are  incorporated  to  the  branch-and-cut  algorithm.  However,  we  do  not  give  an  extensive 
computational  study  and  the  heuristic  separation  we  use  needs  significant  improvement. 


2  Complexity 

We  first  show  that  TPMC  is  strongly  NP-hard  in  general. 

Proposition  1.  The  decision  version  of  TPMC  is  NP-complete  even  when: 

1.  Si  =  1  for  all  i  £  Vi,  dj  =  d>  3  for  all  j  £  Vi,  Wij  =  0  for  all  ( i,j )  €  E  and  rj  =  1  for  all  j  £  V 2. 

2.  |  Pi  |  =  1  and  Wij  =  0  for  all  ( i,j )  €  E. 

The  proof  for  Proposition  1  Part  1  is  similar  to  the  proof  of  a  related  result  presented  in  [22].  For 
completeness,  we  provide  its  proof  and  the  proof  of  Part  2  in  the  Appendix.  Because  the  reduction  of  Part  1 
is  from  the  Exact  3-Cover  problem,  which  is  strongly  NP-complete  [10],  we  conclude  that  TPMC  is  strongly 
NP-hard  even  for  the  case  where  all  demands  are  equal  to  three.  In  contrast,  Proposition  2  shows  that  TPMC 
is  polynomially  solvable  when  demands  of  all  markets  do  not  exceed  two. 

Proposition  2.  Suppose  that  dj  <  2  for  all  j  £  V 2.  Then  there  exists  a  polynomial-time  algorithm  to  solve 
TPMC. 


This  result  is  proven  by  a  polynomial  time  reduction  to  a  minimum  weight  perfect  matching  problem  on 
a  general  graph  (provided  in  the  Appendix).  The  key  ideas  of  the  reduction  are  based  on  those  presented 
in  [3] .  This  result  can  also  be  proven  by  a  polynomial  time  reduction  to  the  6-matching  problem  [9] ,  see  also 
Theorem  36.1  in  [23]. 

A  matrix  A  is  said  to  have  the  Edmonds-Johnson  property  if  the  sum  of  the  absolute  values  of  the  entries 
in  any  column  of  A  is  less  than  or  equal  to  2.  Edmonds  and  Johnson  [9]  show  that  the  convex  hull  of  integer 
solutions  to  a  system  Ax  <  6,  where  A  has  this  property  is  given  by  the  so-called  blossom  inequalities.  Note 
that  the  constraint  matrix  defined  by  inequalities  (lb),  (lc),  (le),  and  z  £  r!^21  have  the  Edmonds-Johnson 
property  when  dj  <  2  for  all  )  £  V)-  Hence  adding  the  blossom  inequalities  to  the  original  formulation  is 
enough  to  give  the  convex  hull  of  solutions  to  TPMC  in  this  case.  The  blossom  inequality  for  TPMC  is 


^  '  x  ij  +  /*  '  [dj/2\zj  < 

i£Ui,jGU2'.(i,j)EE  j£U2 


s(Hi)  +  d(H2) 
2 


(3) 


where  U\  C  V),  U2  C  V2  such  that  the  sum  of  total  supply  in  U\  and  total  demand  in  t/2,  s{U\)  +  d(f72),  is 
odd.  The  separation  of  blossom  inequalities  (3)  is  polynomial  [12,  17,  20].  We  propose  other  classes  of  valid 
inequalities  for  the  general  case  in  Section  4. 


3  TPMC  with  a  cardinality  constraint 

An  important  and  natural  constraint  that  one  may  add  to  the  TPMC  problem  is  that  of  a  service  level, 
i.e.,  the  number  of  rejected  markets  is  restricted  to  be  at  most  k.  This  restriction  can  be  modelled  using 
a  cardinality  constraint ,  J2jev2  zo  —  appended  to  (la)-(le).  We  call  the  resulting  problem  cardinality 
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constrained  TPMC  (CCTPMC).  If  we  are  able  to  solve  CCTPMC  in  polynomial-time,  then  we  can  solve 
TPMC  in  polynomial  time  by  solving  CCTPMC  for  all  k  G  {0, . . . ,  |  V2 1 } -  Therefore  by  Proposition  1,  we 
obtain  that  CCTPMC  is  NP-hard  in  general.  In  this  section,  we  examine  the  specific  case  where  we  know 
that  TPMC  admits  a  polynomial-time  algorithm. 

In  light  of  the  proof  of  Proposition  2,  via  the  reduction  to  a  minimum  weight  perfect  matching  problem 
on  a  general  (non-bipartite)  graph  G'  =  (V',  E’),  it  is  possible  to  reduce  CCTPMC  with  dj  <  2  for  all  j  G  V2 
to  a  minimum  weight  perfect  matching  problem  with  a  cardinality  constraint  on  a  subset  of  edges  (specifically 
the  cardinality  constraint  is  applied  only  on  the  edges  (j,f)  €  E'  for  each  j  G  V2;  see  proof  of  Proposition 
2  in  Appendix).  To  the  best  of  our  knowledge,  the  complexity  status  of  minimum  weight  perfect  matching 
problem  on  a  general  graph  with  a  cardinality  constraint  on  a  subset  of  edges  is  open.  This  can  be  seen  by 
observing  that  if  one  can  solve  minimum  weight  perfect  matching  problem  with  a  cardinality  constraint  on 
a  subset  of  edges,  then  one  can  solve  the  exact  perfect  matching  problem;  see  discussion  in  the  last  section 
in  [6].  On  the  other  hand,  we  will  prove  in  this  section  that  CCTPMC  with  dj  <  2  for  all  j  G  V2,  which  is  a 
special  case  of  a  minimum  weight  perfect  matching  problem  with  cardinality  constraint  on  a  specific  subset 
of  edges,  in  fact  admits  a  polynomial-time  algorithm.  Our  approach  will  be  following:  We  will  prove  that  the 
TPMC  polytope  (when  dj  <  2  for  all  j  G  V2)  along  with  the  constraint  J^jeVz  zi  —  &  is  integral.  Therefore  by 
invoking  the  ellipsoid  algorithm  it  is  possible  to  solve  CCTPMC  in  polynomial  time.  This  result  also  allows 
for  solving  CCTPMC  (when  dj  <  2  for  all  j  G  V2)  by  a  Lagrangian  relaxation  approach,  where  we  relax  the 
cardinality  constraint. 

Before  we  proceed,  we  briefly  note  that  the  intersection  of  the  perfect  matching  polytope  with  a  cardinality 
constraint  on  a  strict  subset  of  edges  is  not  always  integral. 

Example  1.  Consider  the  bipartite  graph  G(V 1  U  V2,  E)  with  V\  =  {1,  2,  3},  V2  =  {4,  5,  6},  E  =  {(1, 4),  (1,  5), 
(2, 4),  (2,  5),  (2, 6),  (3,  5),  (3,  6)},  and  the  cardinality  constraint  X\±  +  x2$  <  1.  It  is  straightforward  to  show 
that  xu  =  X15  =  x24  =  x2$  =  0.5,  X26  =  £35  =  0,2:36  =  1  is  a  fractional  extreme  point  of  the  intersection  of 
the  perfect  matching  polytope  with  the  cardinality  constraint. 

Let  X  G  x  {0, 1} I A  2 1  be  the  set  of  feasible  solutions  of  TPMC.  Our  main  result  of  this  section  is 
presented  next. 

Theorem  1.  Let  k  G  Z+  and  k  <  \V2\.  Let  Xk  :=  conv(A  n  {(2 :,z)  G  x  {0,  ljTd  |  J2jey2  zj  <  &})•  If 
dj  <  2  for  all  j  G  V2,  then  Xk  =  conv(X)  f)  {(x,  z )  G  x  [0,  l]^  |  J2jev2  zo  —  &}• 

Corollary  3.  CCTPMC  is  polynomially  solvable  when  dj  <  2  for  all  j  G  V2. 

Observation  1.  Theorem  1  is  a  generalization  of  the  well-known  result,  Matching  Cardinality  Theorem:  Let 
G(V,  E)  be  a  graph  with  n  vertices  and  m  edges.  Let  M  C  Mm  be  the  matching  polytope  and  let  Mk  C  Rm  be  the 
convex  hull  of  incidence  vectors  of  matchings  with  at  least  k  edges.  Then  Mk  =  MC\{x  G  Rm  |  1  xi  >  &}  • 

(See  [23]  for  a  proof.) 

We  construct  a  bipartite  graph  GiV1  U  V2,E)  as  follows:  V1  is  a  set  of  n  vertices  corresponding  to  the 
n  vertices  in  G.  V2  corresponds  to  the  set  of  edges  ofG,  i.e.,  V2  contains  m  vertices.  We  use  ( i,j )  to  refer 
to  the  vertex  in  V2  corresponding  to  the  edge  ( i,j )  in  E.  The  set  of  edges  in  E  are  of  the  form  (i,  ( i,j ))  and 
(j,  (i,  j))  for  every  i.j  G  V  such  that  (i,  7)  G  E.  Now  we  can  construct  (the  feasible  region  of)  an  instance  of 
TPMC  with  respect  to  GfV1  U  V2,  E)  as  follows: 

Q  =  {(2:,  z)  G  R2m  x  Rm  |  +  xMiJ)+2z(iJ)  =  2  V(i,j)  G  I>2  (4) 

xh(iJ)  <  1  Vi  G  V1  (5) 

j:(i,j)£E 

Z(i,j)  e  {°, i}  v(*,i)  €  V2}.  (6) 

We  can  construct  an  instance  of  CCTPMC  by  adding  the  constraint  jjeE  z(i,j)  <  k  (call  this  set  Qk). 
It  is  straightforward  to  verify  that  the  Matching  Cardinality  Theorem  is  equivalent  to  stating  con v(Qk)  = 
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conv(Q)  H  {(a;,  z)  \  Y2(  i  j)^E  zL,j)  —  k}-  Thus,  the  Matching  Cardinality  Theorem  follows  from  Theorem  1 
applied  to  the  bipartite  graph  G. 

Now  note  that  the  graph  G  has  a  very  special  structure.  In  particular,  the  degree  of  every  node  in  the 
second  set  of  vertices  ( V 2)  is  2.  On  the  other  hand,  Theorem  1  holds  for  a  general  instance  of  TPMC  with 
dj  <  2  for  all  j  €  Vi,  i.e.  in  particular  for  instances  corresponding  to  general  bipartite  graphs  where  the 
degree  of  the  vertices  can  be  more  than  2  and  the  value  of  dj  can  be  either  1  or  2.  □ 

To  prove  Theorem  1,  one  approach  could  be  to  appeal  to  the  reduction  to  minimum  weight  perfect 

matching  problem  and  then  use  the  well-known  adjacency  properties  of  the  vertices  of  the  perfect  matching 
polytope.  However,  as  illustrated  in  Example  1,  the  integrality  result  does  not  hold  for  the  perfect  matching 
polytope  on  a  general  graph  with  a  cardinality  constraint  on  any  subset  of  edges.  Therefore  a  generic  approach 
considering  the  perfect  matching  polytope  appears  to  be  less  fruitful.  We  use  an  alternative  approach  to  prove 
this  result.  In  particular,  we  apply  a  technique  similar  to  that  used  in  [2].  Consider  the  following  desirable 
property: 

Definition  1  (Edge  Property).  Let  T  C  x  {0,  l}n  be  some  mixed  integer  set.  We  say  that  T  satisfies  the 
edge  property  if  for  all  (w,  r )  £  Rp+"  such  that  min{u>Tx  +  rT  z  \(x,z)  £  T}  is  bounded  and  has  at  least  two 
optimal  solutions,  (x1,  z1)  and  {x2  ,z2)  where  Xq=i  zj  =  k1 ,  z“j  =  k2  anc^  k1  <  k2  —  2,  then  there  is  an 

optimal  solution  ( x3,z 3)  such  that  Xq=i  zj  =  k 3  and  k 1  <  k3  <  k2 . 

Proposition  4.  Let  T  C  x  {0, 1}"  be  a  mixed  integer  set  such  that  conv(T)  is  a  pointed  polyhedron 

and  let  Tk  :=  con v(T  fl  {(x,z)  £  x  {0, 1}"  |  Xq=i  zj  <  kj).  If  T  satisfies  the  edge  property,  then  Tk  = 
conv(T)  (~l  {(x,  z)  £Rp+x  [0, 1]”|  X)”=i  zj  <  k}. 

Proof.  Assume  by  contradiction  that 

n 

Tk  ^  conv(T)  n  {( x ,  z)  £  R+  x  [0, 1]"|  !><*}. 

i= i 

for  some  k  =  k'  £  {0,1,...,  n}.  By  definition  Tk  =  con  v(T  fl  {{x,z)  £  R+  x  {0, 1}"|  Y^j=\  zi  —  &})  so 
Tk  C  conv(T)  fl  {(x,  z)  £  x  [0, 1]"|  zj  —  k}  holds  for  all  k  £  {0, 1, . . . ,  n}.  By  assumption  we  obtain 

Tk  C  conv(T)  n  {(x,  z)  £  R+  x  [0, 1]”|  Xq=1  Zj  <  k'}.  Since  conv(T)  is  pointed  this  implies  that  there  exists 
a  vertex  (x' ,  z')  of  conv(T)  fl  {(x,  z)  £  x  [0, 1]"|  Y^=i  zj  —  k! }  such  that  ( x' ,  z')  ^  Tk‘ .  Therefore  z'  is 
fractional  and  J2j=i  zj  ~  k'  (if  z'j  <  k' ,  then  this  point  is  also  a  vertex  of  conv(T),  therefore  integral 

and  belonging  to  Tk  -  a  contradiction). 

Since  {x' ,z')  is  not  a  vertex  of  conv(T),  there  exists  ( w,r )  such  that  the  vertex  (a :',z')  is  the  intersection 
of  the  face  defined  by  {(a:,  z)  £  x  [0, 1]"|  Y^j=i  zi  =  k'}  and  an  edge  of  conv(T)  defined  as: 

{(a;,  z)  G  conv(T)  |  tuTx  +  rT z  =  <5},  (7) 

where  6  =  min{wTa:  +  rT z  \  (x,  z)  £  conv(T)}  =  wT x'  +  rT z' .  Let  (a;1,^1)  and  ( x2,z 2)  be  two  feasible 
points  of  T  that  belong  to  the  edge  (7)  such  that  ( x',z ')  is  a  convex  combination  of  (x1,^1)  and  ( x2,z 2). 
Note  that  5  =  wTx'  +  rT z'  =  wT x1  +  rT z1  =  wT x2  +  rTz2.  Hence,  (x1,^1)  and  ( x2,z 2)  are  two  optimal 
solutions  corresponding  to  the  objective  function  (■ w,r ).  Furthermore,  due  to  our  selection  of  S,  2  zj  <- 
k'  <  ^jev2  zj-  The  edge  property  ensures  that  there  exists  an  integral  optimal  solution  ( x3,z 3)  with 
k3  =  J2jev2  zj  ~  k'  such  that  J2jev2  z)  <k3  <  J2jev2  zp  However,  this  implies  that  (x3,  z3)  belongs  to  the 
edge  defined  by  (7).  Thus,  (x3,z3)  must  be  a  convex  combination  of  (x1,^1)  and  ( x2,z 2)  or  equivalently,  we 
must  have  (x3,z3)  =  ( x',z ')  with  z'  integral,  a  contradiction.  □ 

Now,  we  show  how  edge  property  and  Proposition  4  can  be  applied  to  TPMC  with  an  additional  constraint 
that  at  most  k  markets  can  be  rejected.  To  prove  Theorem  1  we  use  Proposition  4.  Similar  to  the  argument 
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in  the  proof  of  Proposition  2,  we  assume  that  all  data  are  integral,  and  that  .s,  =  1  for  all  i  €  V i  without  loss 
of  generality.  It  is  straightforward  to  verify  that  the  polyhedron  X  corresponding  to  the  original  instance 
with  Si  >  1  for  some  i  G  V\  satisfies  the  edge  property  if  and  only  if  X  corresponding  to  the  corresponding 
instance  with  s3  =  1  for  all  i  G  V\  satisfies  the  edge  property.  We  are  now  ready  to  present  the  proof  of 
Theorem  1. 

Proof  of  Theorem  1.  By  hypothesis  dj  <  2  for  all  j  G  Vi.  From  Proposition  4  it  is  sufficient  to  prove  that  the 
edge  property  holds. 

Suppose  that  (a;1,#1)  and  ( x2,z 2)  are  optimal  solutions  to  min{wTx  +  rT z  \  (x,z)  G  X}  and  that  xl  is 
fractional.  Then  we  can  solve  a  simple  transportation  problem  with  the  set  of  demand  nodes  j  such  that 
zj  =  0.  Since  all  data  is  integral,  there  exists  an  optimal  solution  with  integral  flows.  Therefore,  we  may 
assume  that  x1  (and  similarly  x2)  are  integral. 

Claim  1.  Suppose  we  have  two  feasible  solutions  of  X,  namely  (x3,z3)  and  (x4,z4),  such  that 
1 ■  E jev2  zj  =  kl  +  1  and  Ejev2  z4  =  k2  -  1  and 

2.  The  objective  function  value  of  ( x 3,  z3)  is  p  —  S  and  that  of  ( x 4,  z4)  is  p  +  5,  where  p  is  the  objective 
function  value  of  the  solution  (x1,  z1)  and  6  G  R, 

then  the  proof  of  Theorem  1  is  complete. 


Proof.  Since  p  is  the  optimal  objective  function  value,  we  obtain  that  <5  =  0  since  otherwise  the  objective 
function  value  of  either  ( x3,z 3)  or  ( x4,z 4)  is  better  than  that  of  (a;1,^1).  Therefore  ( x3,z 3)  is  an  optimal 
solution  with  k 1  <  Ejev2  zj  <  &2-  Because  edge  property  is  satisfied  by  Proposition  4,  the  proof  of  Theorem 
1  is  complete.  □ 

Given  an  integral  point  (x,  z)  of  X ,  let  S(z)  :=  {j  G  V2  \  Zj  =  0}  be  the  set  of  nodes  in  V2  whose  demands 
are  met.  For  j  G  S(z),  let  Ij(x,z)  =  {i  G  V\  \  Xjj  >  0}  =  {i  G  V\  |  x4j  =  1}  be  the  set  of  suppliers  that  sends 
one  unit  to  j. 

Given  the  optimal  solutions  (x1,  z1)  and  (a;2,  z2),  let  F  :=  ( S(z 1)  \  ^(z2))  U  ( S(z 2)  \  S(z1)),  P  :=  S'(2:1)  fl 
S(z2)  and  R  :=  V2  \  (F  U  P).  For  j  G  P,  observe  that  only  the  set  Ij(x1,z1)  or  the  set  Ij(x2,z2)  is  defined. 
So  for  j  G  F,  we  define  I3  as: 


r  ._  /  j  &  S{zl)\S{z2)  (  . 

I3(x2,z2)  if  j  G  S{z2)  \  S(z4).  W 

As  a  first  step  towards  constructing  (a :3,z3)  and  ( x4,z 4)  required  in  Claim  1,  we  construct  a  bipartite 
(conflict)  graph  G*{U\  UU2,S).  The  set  of  nodes  is  constructed  as  follows: 

1.  If  j  G  5'(^1)  \  *S'(^2)  ,  then  j  G  U\  and  j  is  called  a  full  node.  Let  W\  =  S^z1)  \  S(z2)  be  the  set  of  full 
nodes  of  Pi. 

2.  Similarly,  if  j  G  S(z2)  \  S(z4)  ,  then  j  G  U2  and  j  is  called  a  full  node.  Let  W2  =  S(z2)  \  S(z4)  be  the 
set  of  full  nodes  of  U2. 

3.  If  j  G  S^z1)  fl  S(z2)  and  dj  =  2  then  we  place  two  copies  of  node  j  in  U\  (call  these  j\  and  j2)  and  two 

copies  of  j  in  U2  (call  these  J3  and  jf).  These  nodes  are  called  partial  nodes  of  j.  Each  partial  node  of 

j  is  distinct:  If  /^(a:1,^1)  =  {ti,t2},  then  associate  (WLOG)  t\  with  j\  and  t2  with  j2,  that  is  define 
Ij3  :=  {ti}  and  Ij2  :=  {t2}-  Similarly  if  Ij(x2,z2)  =  {^3,^4},  then  associate  (WLOG)  ts  with  j3  and  O 
with  j4,  that  is  define  Ij3  :=  {t3}  and  Ij4  :=  {G}.  If  j  G  <S'(z1)  fl  S(z2)  and  dj  =  1,  then  we  place  one 
copy  of  node  j  in  U\  (call  this  j\ )  and  one  copy  of  j  in  P2  (call  this  j3).  Similar  to  the  dj  =  2  case 
these  nodes  are  called  partial  nodes  of  j.  If  Ij(x1,z1)  =  {ti}  and  Ij(x2,z2)  =  {t3 },  then  set  Ij1  =  {p} 
and  Ij3  =  {O}.  Let  P  =  P1  U  P2,  where  P1  =  {j  G  P  :  dj  =  1}  and  P2  =  {j  G  P  :  dj  =  2}. 
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Thus  U\  =  W\  U  U  ^Ujepiii1})  and  f°r  eac^  element  a  €  Ui  the  set  Ia  is  well-defined  and 

non-empty.  Similarly,  C/2  =  W2  U  ^UjeP2^3’-?4})  u  (U^epiO3})  aRd  for  each  element  b  G  U2  the  set  h  is 
well-defined  and  non-empty.  Now  we  construct  the  edges  £  as  follows:  For  all  a  G  U\  and  b  G  C/2,  there  is  an 
edge  (a,  b)  £  £  if  and  only  if  a  and  b  have  at  least  one  common  supplier,  i.e., 


Ia  n  lb  7^  0  iff  (a,  b)  G  £. 


(9) 


Let  G'(V',E')  be  a  subgraph  of  G*(U\  U  Ui,£).  Since  the  elements  in  V’  (~l  {W\  U  W2)  correspond  to  unique 
elements  in  V2,  whenever  required  we  will  (with  slight  abuse  of  notation)  treat  V'  fl  ( W\  U  W2)  C  V2. 

Claim  2.  Let  G'(V',  E')  be  a  subgraph  of  G*{U\  U  C/2,  £)  satisfying  the  following  properties: 

1.  There  are  no  edges  in  G*  between  the  nodes  in  V'  and  the  nodes  in  (Ui  U  C/2)  \  V’ . 

2.  For  each  j  G  P1 ,  \V'  D  {ji}|  =  \V'  fl  {j3}|  and  for  each  j  G  P2 ,  \V'  (~l  { ji ,  j2}\  =  \V'  n  {j3,u}\- 

3.  |VFi nh'|  =  \w2nv'\  +  1. 


Now  construct 


=  < 


z- 

3 

1 

0 

1 

1 

1 

1 

1 

0 


if  j  G  V2  \  {V  n  F) 
if  jGV'n  w-l 

if  j  g  v'nw2. 

if  j  G  F,  Zj  =  0,i  G  Ij 

ifj  GP,ji  G  ( U1UU2)\V',i£ljl 

if  j  e  P,j2  e  (C/i  u  XJ2)  \V',ie  lj2 

if  j  G  P,Ji  G  V' ,  %  G  Ij3 

if  j  G  P,j4  £V’,i£  Iu 

otherwise. 


(10) 


(11) 


and 


2* 


= 


if  i  g  y2  \  {V  n  f) 
d  if  j  g  V'  n  w4 
1  if  jeV'nw2. 

1  if  j  G  F,  zj  =  0,  i  G  Ij 
1  if  J  GP,j3G  (U1UU2)\V/,i£lj3 
1  if  J  GP,j4G  (PiUP2)\fo',CG/i4 
1  if  j  G  P,J-|  Gl/',iG  4 
1  if  j  G  P,j2  G  V" ,  i  G  /,2 
0  otherwise. 


(12) 


(13) 


Then  (x3,23)  and  (a:4,24)  are  feasible  solutions  of  X  that  satisfy  the  requirements  of  Claim  1. 


Proof.  1.  We  verify  that  (x3,z3)  is  a  valid  solution  to  X.  A  similar  proof  can  be  given  for  the  validity  of 
(a:4, 24).  Clearly  a:3  and  23  satisfy  the  variable  restrictions.  We  verify  that  the  constraint  j)eE  x%  + 
djZj  =  dj  is  satisfied  for  all  j  G  ^2-  If  j  G  P,  then  z3  =  zj  =  1  and  x3j  =  0  for  all  (i,j)  G  E; 
therefore  the  constraint  is  satisfied.  If  j  G  P,  then  using  the  first  and  last  entry  in  (11),  we  have 
J2i-.(i,j)e.E  xij  +  djZj  =  dj.  If  j  G  P,  then  j  G  V2  \  {V  (~l  F).  Therefore  z3  =  zj  =  0.  Now  it  is 
straightforward  to  verify  that  J2i-.(i,j)eE  x%  =  2  =  f°r  each  3  G  P2  since  \V'n{j1,j2}\  =  |V'n{j3,  j4}\ 
and  by  the  use  of  the  last  five  entries  in  (11).  For  j  G  P1  we  have  J2i-.(i,j)eExij  =  1  =  dj  since 
C  {ji}|  =  \V'  fl  {j3} |  and  by  the  use  of  the  second,  fourth  and  sixth  entries  in  (11). 
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Now  we  verify  that  the  constraint  Ylj-.(i,j)eE  xij  <  1  is  satisfied  for  all  i  G  V\.  Given  i  G  V\,  assume  for 
contradiction  that  x3g  =  x3h  =  1  for  some  g,  h  G  V2  and  g  7/  h.  By  construction  of  ( x 3,  z3),  x\t  =  0  for 
all  j  G  R.  Thus,  g,  h  ^  R.  Moreover  since  j)eE  x%  +  djZj  =  dj  is  satisfied  for  all  j  G  V2,  we  have 
zg  =  zh  =  0.  Now,  there  are  three  cases  to  consider: 

(a)  g,h  G  F.  By  construction  of  x3  we  have  i  G  I g  fl  H, .  Now  if  g  ^  V'  and  h  ^  V' ,  then  by  construction 
of  z3  (first  entry  in  (10))  we  have  zg  =  z3  =  0  =  z%  =  z\  and  thus  g ,  h  G  S'(z1).  Therefore  by  the 
validity  of  (a:1,  z1)  we  have  Ig  n  /ft  =  0.  This  contradicts  i  G  IgC]  Ih ■  Now  consider  the  case  where 
g  G  V'  and  h  G  V' .  Since  i  G  I g  PI  /ft  by  (9)  there  is  an  edge  between  g  and  h  in  G*{U\  U  H2,£). 
Thus  we  may  assume  without  loss  of  generality  that  g  G  V'  (~l  W\  and  h  G  V'  D  W2 ■  However, 
this  implies  that  z3  =  1,  a  contradiction.  Now,  without  loss  of  generality,  assume  that  g  G  V'  and 
h  £  V' .  Since  i  G  Ig  PI  Ih  by  (9)  there  is  an  edge  between  g  and  h  in  G*(U\  U  U2,£)-  On  the 
other  hand,  by  assumption  there  is  no  edge  between  nodes  in  Vr  and  those  not  in  V7,  which  is  the 
required  contradiction. 

(b)  g  G  F  and  h  G  P.  Without  loss  of  generality  we  may  assume  that  g  G  W\.  If  g  G  V' ,  then  z3  =  1, 
a  contradiction.  Therefore,  we  have  g  £  V' .  Thus  zg=z3  =  0.  Therefore  by  validity  of  (a:1,^1) 
we  have  i  £  /^(a;1,^1)  or  equivalently  i  G  Ih(x2,z2).  Without  loss  of  generality  we  may  assume 
that  i  G  Ih3-  Note  that  /13  belongs  to  V'  (by  the  construction  of  a:3  and  the  fact  that  x3h  =  1  and 
i  G  I/, 3 ).  Since  i  G  Ig,  there  exists  an  edge  between  g  and  /13.  However,  since  g  V'  and  /13  G  V' , 
we  get  a  contradiction  to  the  fact  that  there  are  no  edges  between  the  nodes  in  V'  and  the  nodes 
in  (Hi  U  U2)  \  V' . 

(c)  g,h  G  P.  In  this  case  we  may  assume  without  loss  of  generality  that  i  G  Ig(x1,z1)  and  i  G 
Ih(x2,z2).  Therefore  without  loss  of  generality,  we  may  assume  that  i  G  Igi  and  i  G  /ft3.  Since 
xig  ~  xih  =  I  we  have  g\  ^  V'  and  /i3  G  V' .  By  assumption  on  G' ,  this  implies  that  there  is  no 
edge  between  g\  and  ft.3.  On  the  other  hand,  since  i  G  Igi  D Ih3  by  (9)  we  have  an  edge  (<q,  ft.3)  G  £, 
a  contradiction. 

2.  Next  we  verify  that  the  objective  function  value  of  (x3,  z3)  is  p  —  <5  and  that  of  (a:4,  z4)  is  p  +  i5  where 
p  is  the  objective  function  value  of  the  solution  (a:1,^1)  and  8  G  R.  This  result  is  verified  by  showing 
that  (a :3,z3)  and  (a:4,2:4)  are  obtained  by  ‘symmetrically’  updating  demands  from  (a:1,^1)  and  (a ’2,z2) 
respectively.  In  particular,  we  examine  each  demand  node  and  examine  the  cost  of  either  satisfying  it 
or  not  satisfying  it  in  each  solution.  We  consider  the  different  cases  next: 

(a)  j  G  R.  Then  z4  =  z3  =  z j  =  z2  =  1. 

(b)  j  G  V'  (~1  H  j .  Then  zj  =  0  and  z3  =  1.  On  the  other  hand  z2  =  1  and  z4  =  0.  Notice  that  in  each 

solution  where  dj  is  satisfied,  this  is  done  by  using  the  same  set  of  input  nodes  (and  thus  using 
the  same  arcs).  Therefore  the  difference  in  objective  function  value  between  (ar,^1)  and  (a:3,z3) 
due  to  demand  node  j  is  —  wij  +  rj  and  the  difference  in  objective  function  value  between 

the  solutions  (a;2,  z2)  and  (a;4,  z4)  due  to  demand  node  j  is  —  rg. 

(c)  j  G  V'  (~1  W2.  Similar  to  the  above  case  the  difference  in  objective  function  value  between  (a:1,  z1) 

and  (a;3,  z3)  due  to  demand  node  j  is  Wio  ~  ro  an<^  ^4e  difference  in  objective  function  value 

between  (a;2,  z2)  and  (a:4,  z4)  due  to  demand  node  j  is  —  Wii  +  ri- 

(d)  j  G  F\  V' ,  then  zj  =  z3  and  z2  =  z4. 

(e)  j  G  P2  such  that  j\ ,  j2  G  ( U\GU2)\V'  andj3,j4  G  (HlUf72)\H,.  Then  the  demand  dj  is  satisfied 
by  the  nodes  in  Ij(x1,z1)  in  (adp1)  and  (a:3,^3).  Therefore  there  is  no  difference  in  objective 
function  value  between  (a;1,  z1)  and  (a:3,  z3)  with  respect  to  demand  node  j .  Similarly,  the  demand 
dj  is  satisfied  by  the  nodes  in  Ij{ x2,  z 2)  in  (a:2,  z2)  and  (a:4,  z4)  and  there  is  no  difference  in  objective 
function  value  between  (a;2,  z2)  and  (a;4,  z4)  with  respect  to  demand  node  j.  We  can  make  a  similar 
argument  for  j  G  P 1  such  that  ji  G  (Hi  U  U2)  \  V '  and  j'3  G  (Hi  U  H2)  \  V' . 
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(f)  j  G  P2  such  that  j i  G  V',  j2  G  (C/i  U  U2)  \  V' ,  j3  G  (4  U  U2)  \  V',  j4  G  V’  without  loss  of 
generality.  Then  the  demand  dj  is  satisfied  by  the  nodes  in  ( 4  U  Ij2 )  in  (x1^1)  and  by  nodes 
(4  U  4)  in  ( x3,z 3).  Therefore  the  difference  in  objective  function  value  between  (x1,^1)  and 
(x3,23)  with  respect  to  demand  node  dj  is  Tiei  Wio  ~  Thiel  Wir  The  demand  dj  is  satisfied 
by  the  nodes  in  (4  U  Ij4)  in  ( x2,z 2)  and  by  the  nodes  in  (4  U  4)  in  (x4,24).  Therefore  the 
difference  in  objective  function  value  between  (x2,  z2)  and  (x4,  z4)  with  respect  to  demand  node  j 
is  Tier  Wii  ~  Tier  wn-  We  can  m&ke  a  similar  argument  for  the  cases:  ji  G  (4  U  4)  \  V' , 
32  G  V'Js  G  V',  j4  e31(Ui  u  U2)  \  V';  j  1  G  V',  j2  G  (4  U  U2)  \  V',  j3  G  V' ,  j4  G  (4  U  U2)  \  V'  and 
jl  G  (4  U  U2)  \  V ,  J2  G  V7,  J3  G  ([4  U  U2)  \  V' ,  j4  G  y7. 

(g)  J  G  P2  such  that  ji  G  V',  j2  G  V77,  j'3  G  F7,  j'4  G  V'.  Then  the  demand  dj  is  satisfied  by  the 

nodes  in  ( Ij1  U  Ij2)  in  (x1,^1)  and  by  the  nodes  in  (Ij3  U  Ij4)  in  ( x3,z 3).  Therefore,  the  difference 
in  the  objective  function  value  between  (x1,^1)  and  (x3,,z3)  with  respect  to  satisfying  demand 
dj  is  Tie(i ■  u /■  )(wij  +  Wij )  —  Tie(i-  u r  )(wij  +  wij)-  demand  dj  is  satisfied  by  the  nodes 
in  (4  U  Ij4)  in  (x2,z2)  and  by  the  nodes  in  (4  U  4)  in  i.xAizi)-  Therefore,  the  difference  in 
the  objective  function  value  between  (x2,z2)  and  (x4,z4)  with  regards  to  satisfying  demand  dj  is 
-  Thai,..  +  wv)  +  Tie(ij3uij4)(wv  +  wa)-  For  iepl.  we  can  similarly  consider  j4  and 

j3  with  ji  g  y7,  j3  g  y7. 

Therefore,  the  objective  function  value  of  (x3,  z3)  is  p  —  6  and  that  of  (x4,  z4)  is  p  +  S  where  p  is  the 
objective  function  value  of  the  solution  (x1,  z1)  and  (x2,  z2)  and  <5  G  R. 

3.  Finally  we  verify  that  Tjev2  zj  =  k1  +  1  and  Tjev2  zj  =k2-l.  We  prove  this  for  (x3,  z3).  The  proof 
is  similar  for  the  case  of  (x4,  z4).  Observe  that  if  j  G  R,  then  zj  =  z3  =  1.  If  j  G  P,  then  zj  =  z3  =  0. 
If  j  G  F  \  y7,  then  zj  =  z3.  If  j  G  W\  D  V’ ,  then  zj  =  0  and  z3  =  1  and  if  j  G  W2  (~l  V’ ,  then  2:1  =  1 
and  z3  =  0.  Thus  Tjev2  z)  ~  Tjev2  zj  =  W  n  —  \V'  D  W4\  =  —1,  where  the  last  equality  is  by 
assumption  (3)  of  G' .  Thus,  Tjev2  zj  =  k1  +  1. 

□ 

Now  the  proof  of  Theorem  1  is  complete  by  showing  that  a  subgraph  G'(V' ,  E')  of  G*(U4  U  U2l  £)  always 
exists  that  satisfies  the  conditions  of  Claim  2.  In  order  to  prove  this,  we  verify  a  few  results. 

Claim  3.  Connected  components  of  G*  are  paths  or  cycles  of  even  length  and  all  the  cycles  involve  only  full 
nodes. 

Proof.  This  is  evident  from  the  fact  that  G*  is  bipartite  and  degree  of  a  G  (4  U  4)  is  bounded  from  above 
by  |4|-  □ 

We  associate  a  value  Vj  to  each  node  j  G  U4  U  C4  In  particular: 

1.  If  j  G  llj ,  then  Vj  =  1. 

2.  If  j  G  Pi  and  j  is  a  partial  node,  then  v3  =  j . 

3.  If  j  G  U-2  and  j  is  a  partial  node,  then  Vj  =  — 

4.  If  j  G  W2,  then  Vj  =  —  1. 

For  a  subgraph  G(V,  E )  of  G*  we  call  v{V)  =  Tjev  vi  lFe  va^ue  °f  Ike  path. 

Claim  4.  v{U\  U  4)  =  k2  —  k1  >  2. 

Proof.  Tjeux\ju2  vi  =  Tjew,  vj+TjeP2(vn+vh)  +  TjeP'  Vh+Tjew2  vj  +  TjeP^vh+vn)  +  TjeP'  vh  = 

I S(z4)  \  S(z2) I  -  I  <42)  \  54)1  =  \S(z4)\  -  |54)|  =  k 2  -  k\  □ 
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Claim  5.  If  G(V,E)  is  a  cyclic  subgraph  of  G*{U\  U  U2,S),  then  v(V)  =  0. 


Proof.  By  Claim  3,  a  cycle  has  only  full  nodes.  Moreover,  since  a  cycle  is  of  even  length,  it  contains  equal 
number  of  nodes  from  W\  and  W2.  □ 

Note  that  a  partial  node  must  be  a  leaf  node  in  a  path.  Using  this  observation  and  by  some  simple  case 
analysis  the  following  three  claims  can  be  verified. 

Claim  6.  If  G(V,E)  is  a  path  containing  exactly  one  partial  node,  then  v(V)  £  {— ^,  5}. 

Claim  7.  If  G(V,E)  is  a  path  containing  two  partial  nodes,  then  v(V)  =  0. 

Claim  8.  IfG(V,E)  is  a  path  containing  only  full  nodes,  then  v(V)  £  {—1,0,1}. 

For  the  subgraph  G(V,  E),  consider  a  k  €  V\F  such  that  k  =  jt  where  t  £  {1, 2, 3, 4}  and  j  £  P2.  Suppose 
k  =  j-\  or  j 2,  then  we  say  that  a  path  G(V,  E)  is  a  mirror  path  for  j ,  if  V  contains  either  j3  or  j4.  Moreover 
we  call  one  of  j3  or  j4  (whichever  belongs  to  V  or  arbitrarily  select  one  of  these  if  both  belong  to  V)  as  the 
mirror  node.  Similarly  if  k  =  j3  or  j4,  then  we  say  that  a  path  G(V,  E)  is  a  mirror  path  for  j,  if  V  contains 
either  j\  or  j2.  Mirror  node  is  similarly  defined  in  this  case.  For  j  £  P 1  we  consider  k  =  j  1  and  k  =  j3. 
Suppose  k  =  j  1,  then  we  say  that  a  path  G(V,E)  is  a  mirror  path  for  j,  if  V  contains  j3  and  we  call  j3  the 
mirror  node.  Similarly  if  k  =  js,  then  we  say  that  a  path  G(V,  E )  is  a  mirror  path  for  j ,  if  V  contains  j\  and 
we  call  j\  the  mirror  node. 

Algorithm  1  constructs  G'(V',E')  that  satisfies  all  the  properties  of  Claim  2.  We  next  verify  that  Al¬ 
gorithm  1  is  well-defined,  that  is  all  the  steps  can  be  carried  out.  Moreover  we  show  that  the  algorithm 
generates  a  subgraph  G'(V' ,E’)  that  satisfies  the  conditions  of  Claim  2. 


Algorithm  1  Construction  of  G'(V',E') 

Input:  G*(UiL)U2,£). 

Output:  G'(V',E')  that  satisfies  all  conditions  of  Claim  2. 

1.  If  there  exists  a  path  G(V,E )  in  G*{Ui  U  U2,f)  containing  only  full  nodes  with  v(V)  =  1,  then  set 
G’  :=  G.  STOP. 

2.  Tag  all  paths  in  G*(Ui  U  U2,£)  as  ‘unmarked.’ 

3.  Select  a  path  G(V,E)  from  the  set  of  ‘unmarked’  paths  containing  a  partial  node  such  that  v(V)  =  } . 
Tag  this  path  as  ‘marked.’  Note  that  by  Claim  6  and  Claim  7,  V  contains  a  unique  partial  node  j*. 

4.  Select  a  path  from  the  list  of  ‘unmarked’  paths,  such  that  it  is  a  mirror  path  for  j*.  Tag  this  path  as 
‘marked.’ 

5.  There  are  three  cases: 

(a)  The  mirror  path  tagged  as  ‘marked’  in  (4)  contains  a  unique  partial  node  and  its  value  is  } . 

GO  TO  Step  6 

(b)  The  mirror  path  tagged  as  ‘marked’  in  (4)  contains  a  unique  partial  node  and  its  value  is  —  \ . 

GO  TO  Step  3. 

(c)  The  mirror  path  tagged  as  ‘marked’  in  (4)  contains  two  partial  nodes  (then  its  value  is  0): 

One  of  the  partial  nodes  corresponds  to  the  mirror  node.  Set  j*  to  be  the  other  partial  node.  GO 
TO  Step  4. 

6.  Set  G'(V',E')  to  be  disjoint  union  of  the  paths  tagged  as  ‘marked.’  STOP. 
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Claim  9.  Algorithm  1  is  well-defined. 

1.  At  the  beginning  of  Step  (3),  the  total  value  of  all  marked  paths  is  0. 

2.  Let  V  :=  Ug(v,b)  is  marked  before  Step  (3)  V-  Then  \V  n  {h, h}\  =  \Vn  {j3 ,ji}\  for  all  j  G  P 2  and 
\V  (~l  {ji}|  =  \V  (~1  {j3}|  .for  all  j  G  P1. 

3.  Step  (3)  is  well-defined,  that  is  as  long  as  the  algorithm  does  not  terminate,  Step  (3)  can  be  carried  out. 

f.  At  the  end  of  Step  (3),  the  total  value  of  all  marked  paths  is 

5.  Step  (4)  is  well-defined,  that  is  as  long  as  the  algorithm  does  not  terminate,  Step  (4)  can  be  carried  out. 

Proof.  We  prove  Claim  9  by  induction  on  the  iteration  number  (n)  of  the  algorithm  visiting  Step  (5).  When 
n  =  0: 

1.  At  the  beginning  of  Step  (3)  there  are  no  ‘marked’  paths  and  therefore  the  total  value  of  all  marked 
paths  is  0. 

2.  H  =  0. 

3.  By  Step  (1),  we  know  that  there  exists  no  path  containing  only  full  nodes  with  v(V)  =  1.  Moreover  by 
Claim  4  we  have  v(U\  U  Uf)  >  2.  Since  by  Claim  5  all  cycles  have  a  value  of  0,  there  must  exist  at  least 
one  path  with  partial  nodes  with  positive  value.  Since  this  is  only  possible  (Claim  6  and  Claim  7)  if 
there  exists  exactly  one  partial  node  in  the  path,  we  see  that  Step  (3)  is  well-defined. 

4.  At  Step  (3)  one  path  is  marked  which  has  a  value  of  half. 

5.  Since  one  path  is  tagged  as  marked  in  Step  (3),  it  contains  exactly  one  partial  node,  j*  G  P.  Suppose 
that  j*  G  P 2  and  j*  =  j*  for  some  i  G  {1, . . . ,  4}.  Then  there  exists  paths  (at  least  two)  which  contain 
the  other  three  partial  nodes  corresponding  to  j* .  If  j*  G  P 1  then  there  exists  one  path  which  contains 
the  other  partial  node.  Therefore  this  step  is  well-defined. 

Now  for  any  n  G  Z+,  assuming  by  the  induction  hypothesis  that  the  result  is  true  for  n'  =  0, . . . ,  n  —  1: 

1.  Step  (3)  is  arrived  at  via  Step  (5b).  Let  n'  <  n  be  the  last  iteration  when  Step  (3)  is  invoked.  By  the 
induction  hypothesis  the  total  value  of  all  the  marked  paths  at  the  end  of  Step  (3)  in  iteration  n'  is  . 
From  iterations  n'  +  l,...,n  —  1,  the  algorithm  alternates  between  Step  (4)  and  Step  (5c).  The  total 
value  of  all  the  marked  paths  here  is  0.  Finally,  the  value  of  the  last  path  tagged  as  marked  in  Step  (4) 
is  —  |  (since  the  algorithm  invokes  Step  (5b)).  Hence,  the  total  value  of  all  the  marked  paths  is  0  at  the 
beginning  of  Step  (3)  in  iteration  n. 

2.  Let  n'  <  n  be  the  last  iteration  when  Step  (3)  is  invoked.  By  the  induction  hypothesis  \V  D  {ji,  J2}  = 
\V  n  {j'3,  J4} |  for  all  j  G  P2  and  \V  Cl  {ji}|  =  \V  ft  {j3}\  for  all  j  G  P1  where 

V  :=  Ug(T A)  is  marked  before  step  (3)  iteration  n>  V.  From  iterations  n'  +  1, . . . ,  n  -  1,  the  algorithm 
alternates  between  Step  (4)  and  Step  (5c).  Since  in  iteration  n  —  1  at  Step  (4),  we  add  one  path  that 
contains  only  the  mirror  node  to  j*  (the  unique  partial  node  from  the  previous  iteration),  we  arrive  at 
this  result. 

3.  Proof  same  as  that  in  the  case  where  n  =  0. 

4.  The  total  value  of  paths  at  the  end  of  Step  (3)  =  value  of  marked  path  +  total  value  of  previously 
marked  path  =  \  +  0. 
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5.  Step  (4)  is  invoked  after  either  Step  (3)  or  Step  (5c).  In  case  we  arrive  via  Step  (3),  by  the  induction 
hypothesis  \V  n  {ji,j2}|  =  \V  D  {j3,ji}\  for  all  j  G  P2  and  \V  n  { ji } |  =  \V  n  {j3}|  for  all  j  G  P 1 
where  V  :=  U g(v,e)  i?  marked  before  Step  (3)  iteration  „■  V.  Moreover  the  path  marked  in  step  (3)  contains 
exactly  a  unique  partial  node  j*  then,  there  must  exist  an  unmarked  path  containing  a  mirror  node  to 
j*.  In  case  of  we  arrive  via  Step  (5c),  again  the  proof  is  essentially  the  same  by  observing  that  at  the 
start  of  Step  (4),  there  is  a  uniciue  partial  node  j*  that  is  not  paired  with  a  mirror  partial  node. 


□ 


Claim  10.  Algorithm  1  terminates  in  finite  time. 


Proof.  This  is  true  since  there  are  a  finite  number  of  edges  and  at  each  iteration  of  the  algorithm  at  least  one 
unmarked  path  is  tagged  as  marked.  □ 

Claim  11.  Algorithm  1  generates  a  subgraph  G'{V' ,E')  that  satisfies  the  properties  of  Claim  2. 

Proof.  First  observe  that  since  the  output  G'(V',  E')  of  the  algorithm  is  a  disjoint  union  of  paths,  there  exists 
no  edge  between  V'  and  (U\  U  U2)  \  V'  in  8 ,  so  property  1  is  satisfied. 

By  Claim  9,  2.  we  have  \V  (~1  {ji ,  J2 } |  =  \V  fl  {j3,  Ja}\  for  all  j  G  P2  and  \V  ft  {ji}|  =  \V  fl  { j'3 } |  for  all 
7  G  P1  where 

U  V. 

G(V,E)  is  marked  before  Step  (3) 

Therefore,  it  is  easily  verified  that  in  the  last  iteration  before  termination,  a  path  with  a  unique  partial 
node,  which  is  a  mirror  node  to  j* ,  is  marked  in  Step  (4).  This  is  because  before  termination  we  arrive  at 
Step  (5a)  implying  that  the  value  of  the  path  marked  in  Step  (4)  is  Hence  Claim  6  and  Claim  7  imply 
that  there  is  a  unique  partial  node  in  this  path.  Thus,  \V'  fl  {ji,j2}|  =  \V'  H  {j3,  j±}\  for  all  j  G  P 2  and 
\V'  C  {ji}|  =  \V'  fl  {j3 } |  for  all  j  G  P1,  so  property  2  is  satisfied. 

Finally,  since  v(V')  =  1  and  \V'  C  {jx,  j2}\  =  \V'  n  {j3, ji}\  for  all  j  G  P2  and  \V'  C  {ji} |  =  \V'  C  {j3}|  for 
all  j  G  P1  we  have 

vj  +  X  vi  =  L 

jev'nwi  jevnw2 

As  a  result,  \V'  fl  W\\  =  \  V'  D  W2|  +  1,  so  property  3  is  satisfied.  □ 


We  showed  that  the  set  of  solutions  to  TPMC  satisfies  the  edge  property.  Theorem  1  then  follows  from 
Proposition  4. 

Finally  we  ask  the  natural  question:  Does  the  edge  property  hold  for  TPMC  when  there  exist  demands 
that  are  greater  than  2?  The  next  example  illustrates  that  the  edge  property  can  fail  to  hold  even  if  dj  >  2 
for  only  one  j  G  Vi- 


Example  2.  Consider  an  instance  of  TPMC  where  G(V \  U  V2,  E)  is  a  bipartite  graph  with  V\  =  {1,  2, . . . ,  6}, 
fo2  =  {1,2,  3, 4},  E  =  {(1,1),  (2, 2),  (3,  3),  (4,1),  (4,4),  (5,2),  (5,4),  (6,3),  (6,4)},  *  =  1,  *  G  Vu  dj  =2,  j  = 
{1,  2,  3},  c?4  =  3.  For  k  =  2  we  obtain  a  non-integer  extreme  point  ofcoirv{T)r\{(x,  z)  G  x  [o,i]"l  £•=!*;< 
k},  given  by  xn  =  x22  =  2:33  =  *41  =  2;44  =  x52  =  a;54  =  xe3  =  x64  =  zx  =  z2  =  z3  =  z4  =  \.  Therefore, 
Tk  7^  conv(P)  fl  {(&,  z)  G  x  [0, 1]”|  "Y^-x  zo  —  example.  Next  we  show  how  the  conflict  graph 

construction  fails  for  this  example.  In  fact,  it  can  be  shown  that  the  edge  property  is  not  satisfied  in  this 
example  by  using  an  alternative  characterization  defined  in  [2].  Let  Wxx  =  vu22  =  w33  =  W41  =  w 44  =  w< 52  = 
W54  =  Wq3  =  Wq4  =  1  and  rj  =3 ,  j  =  {1,2,3}  and  r4  =  6.  For  k  =  0  the  problem  is  infeasible.  For  k  =  1, 
an  optimal  solution  is  Xu  =  x22  =  x33  =  £41  =  £52  =  £’63  =  Z4  =  1  and  all  other  variables  are  zero,  with 
an  objective  function  value  12.  For  k  =  3,  an  optimal  solution  is  X44  =  X54  =  x^4  =  Zx  =  z2  =  z3  =  1  and 
all  other  variables  are  zero,  with  an  objective  function  value  12.  We  show  that  Algorithm  1  fails  to  find  a 
subgraph  G'(V',  E')  of  G*  ( U\  U  U2,S)  that  satisfies  the  properties  given  in  Claim  2  for  this  example.  We  use 
two  feasible  solutions,  namely  solution  for  k  =  1  and  k  =  3  to  build  the  bipartite  graph  given  in  Figure  1. 
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Note  that  I±  =  {1,4},  I2  =  {2,5},  I3  =  {3,6}  and  I4  =  {4,5,6}.  In  Step  (1)  of  Algorithm  1  we  find  a  path 
with  v[V)  =  1  which  is  1  —  4  —  2  then  the  algorithm  stops.  We  have  V'  =  {1, 4,  2}  and  {U\  U  U2)  \  V1  =  {3}. 
However,  property  1  does  not  hold  since  there  exists  an  edge  between  3  and  4  but  3  €  (U\  U  U2 )  \  V’  and 
4  €  V'.  Hence,  Algorithm  1  fails. 


Ux 


U2 


Figure  1:  Bipartite  Graph  G*(U\  U  U2,£)  for  Example  2 


4  Valid  Inequalities 

In  this  section  we  give  valid  inequalities  for  TPMC  and  study  their  strength.  First,  observe  that  the  variable 
upper  bound  inequalities  (VUB)  for  ( i,j )  £  E 

Xij  <  min{  Si ,  dj  }  ( 1  -  Zj )  (14) 

are  valid  for  X. 

Proposition  5.  Let  I  C  Vt ,  J  C  V2  such  that  d(J)  >  s(Vi  \  I).  The  inequality 

yj  +  y  (min  {d(J)  -  sjVx  \I),dj})  Zj  >  d(J)  -  s(Pi  \  I)  (15) 

ieI,jeJ:(i,j)£E  j£J 


is  valid  for  X . 

Proof.  Given  a  feasible  solution  ( x ,  z)  we  consider  two  cases. 

1.  If  Zji  =  1  for  some  f  €  J  such  that  min{c?(J)  —  s(Vi  \  I),dj>}  =  d(J)  —  s(Ei  \  I),  then  the  feasible 
solution  satisfies  inequality  (15)  because  we  have 

y  xa + y  (min {d{J)  -  s(Pi  \  I),dj})  Zj 
ieI,jeJ:(i,j)£E  j£J 

=  y  Xij+  y  (min  {d(J)  —  s(V}  \  /),  dj})  Zj  +  d(J)  —  s(Vi  \  I) 

iei  jeJ\{j'} 

>  d(J)  —  s(Vi  \  I) 

where  the  last  inequality  holds  because  min{(i(J)  —  .s(V)  \I),dj}  >  0  for  all  j  G  J,  and  all  x  and  z 
variables  are  non-negative. 

2.  If  Zj  =  0  for  all  j  £  J  satisfying  min{d(J)  —  s(Vi  \  I),dj}  =  d(J)  —  s(V{  \  I),  then 

EjGj  (min  {d(J)  -  «(Vi  \  I),  dj})  Zj  =  Y.jzj  dj  Zj .  Moreover,  observe  that  Y,iei,jeJ-.(i,j)eE  xij  +  s(^i  \ 
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I)  is  at  least  as  large  as  the  total  flow  sent  to  the  demand  nodes  in  J  in  the  solution  (x,  z),  i.e., 
Eigjjg  J-.{i,j)eE  XH  +  S(U  \  J)  >  Eje j  d^1  -  zo)-  Therefore  we  have 

Y  Xi*  +  H  (min M  J)  “  s(Vi  \  J)> +  s(Ti  \I)>  E  ^  +  E  dj(l  ~  zj)  =  d(J)> 

iei,jeJ:(i,j)eE  jeJ  jeJ  jeJ 

so  inequality  (15)  is  valid. 

□ 


Next,  we  give  valid  inequalities  for  general  mixed-integer  sets  that  are  substructures  of  TPMC. 

4.1  A  Coefficient  Update  Scheme  for  Mixed-Integer  Covers 

Consider  the  mixed  integer  cover  set  defined  by 

t  +  ^2PjZj>Po  (16) 

ieJ 

t  >  0  (17) 

Zj£  {0,1}  WjGJ,  (18) 

for  given  fij  >  0  for  all  j  G  J  and  (3q  >  0.  We  assume  that  fij  <  (3q  for  all  j  £  J  without  loss  of  generality.  Let 

7i  =  conv(S i).  We  refer  to  inequalities  in  the  form  of  (16)  as  type-I  base  inequalities.  Note  that  inequalities 

(15)  for  TPMC  are  in  the  form  of  (16)  since  we  can  replace  Eie/  jej-(i  j)eE  xij  by  1  an(l  t  >  0.  Therefore, 

(16) -(18)  is  a  relaxation  of  TPMC. 

Proposition  6.  Given  a  type-I  base  inequality  (16)  valid  for  a  mixed-integer  program  (MIP)  with  (17)- 
(18),  let  J  :=  {ji,  J2,  •  •  • ,  jP}  C  J  be  a  minimal  cover,  i.e.,  EjeJ  Pj  >  Po  and  E jeJ\{jk}  Pj  -  Po  for  ad 
k  G  {1, . . .  ,p}.  Let  fijp  >  Pjk  for  all  k  G  {1, . . .  ,p}.  Let  J*  :=  J  U  {j  G  J  :  pj  >  Pjp},  p  =  E jeJ  Pj  ~  Po  and 
Po  ■=  Po  -  (p  -  1  )P-  Then, 

t  +  Y  min  {{Po  ~  £),  Po}  zj  +  Y  min  {hi  Pj]  zo  >  Po  (19) 

isT*  jeJ\J* 

is  a  valid  inequality  for  . 

Proof.  We  first  claim  that  (3j  >  [3  for  all  j  G  J* .  Suppose,  without  loss  of  generality,  that  ftj1  <  (3j2  < 
■  ■  ■  <  Pjp ,  and  recall  that  f3j  >  (3jp  for  all  j  G  J*  \  J.  Assume  by  contradiction  that  /3j1  <  /3  or  equivalently 

/ 3j1  —  (ELi  Pjk  ~  Po)  <  0-  This  is  a  contradiction  to  the  minimality  of  the  cover  J. 

Next  we  claim  that  f30  >  0:  By  the  previous  claim  we  have  j3  <  (3jk  for  k  =  1, . . .  ,p.  Therefore,  we  obtain 

p-i 

Po  =  Po  -  (p  -  1  )P  >  Po  -  Y  Pp  -  °’ 

fe= l 

where  the  last  inequality  follows  from  the  fact  that  J  is  a  minimal  cover. 

Given  a  feasible  solution  (x,  z),  let  J\  =  {j  G  J  :  Zj  =  1}  and  J*  =  {j  G  J*  :  Zj  =  1}.  Consider  the 

following  cases: 

1.  Suppose  that  there  exists  j'  G  J{  such  that  min  {p0,  Pj'  —  /?  j  =  P'0-  Then, 

t  +  Y  min  {(Pi  -  P)i  Po}  zj  +  Y  min  {$)>  Pi]  zi 
>  t+  Y  min  {(Pj  -  PPP'o}  z3  +  Y  min  {/3o.  Pj}  Zj  +Po>  Po, 

jeJ*\{j'}  j£J\J* 
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where  the  last  inequality  follows  from  the  fact  that  all  variables  are  non-negative,  f3j  >  /3  for  all  j  G  J* 
and  /3'0  >  0.  The  proof  for  the  case  where  there  exists  j'  G  J\  \  J*  such  that  min  jE,  /3j>  |  =  f3'0  follows 
similarly. 

2.  Suppose  that  for  all  j  G  J*,  we  have  min  j/30,/3j  —  /?j  =  ftj  —  /3  and  for  all  j  G  (Ji  \  J*)  we  have 
min  | /3q ,  fij  j  =  /3j .  There  are  two  cases  to  consider: 

(a)  Suppose  that  |J*|  <p  —  1.  In  this  case, 

t  +  e  (@j  -  p)zj  +  e  = t  +  e  (fij  -  p)  +  e 

j£J*  ieJf 

=  *+Z>  +  E 

je  Ji\Ji 

>  A>-|W>  A>-(P"1)& 

where  the  first  inequality  follows  because  inequality  (16)  is  valid  and  the  second  inequality  follows 
because  of  our  assumption  |  Jf  \  <  p  —  1. 

(b)  Suppose  that  Jf  |  >  p.  In  this  case, 

t  +  E  _  +  E  =  t  +  E  (&j  ~  ft)  +  E  ft? 

>  E  (A-  -  ft)  >  E(^fc  -  ft) 

JGJ*  fc=l 

p 

=  E  ft?*  -Pp  =  Po-(p-  l)/3- 

fc=i 

The  second  inequality  holds  since  |  J*|  >  p  and  since  /3  <  fij,  <  /372  <  •  •  •  <  /3Jp  <  /?,-  for  j  G  J*  \  J. 

□ 

Given  type-I  base  inequalities  (16)  valid  for  any  MIP  with  f  >  0,  and  2j  G  {0, 1},  j  G  J,  we  can  derive 
a  new  class  of  valid  inequalities  (19).  Similarly,  inequality  (19)  is  in  the  form  of  (16),  so  this  process  can 
be  repeated  by  letting  the  valid  inequality  (19)  be  the  type-I  base  inequality  to  derive  other  classes  of  valid 
inequalities. 

Inequality  (19)  is  related  to  the  weight  inequalities  of  Weismantel  [26]  for  the  0/1  knapsack  polytope. 
Note  that  inequality  (19)  is  valid  when  J*  is  replaced  with  J.  After  complementing  the  z  variables,  we 
can  show  that  inequality  (19)  where  J*  is  replaced  with  J  and  the  condition  fij  <  f3'0  for  all  j  G  J  \  J 
is  satisfied  is  equivalent  to  the  weight  inequalities  for  the  0/1  knapsack  polytope  (ignoring  the  continuous 
term  t).  However,  if  J*  D  J  then  inequality  (19)  with  J*  dominates  inequality  (19)  with  J.  Additionally  if 
J*  =  J  and  there  exists  j  G  J\J  such  that  fij  >  f3'0  then  inequality  (19)  dominates  the  corresponding  weight 
inequality.  Weismantel  also  proposes  weight-reduction  and  extended  weight  inequalities  for  the  0/1  knapsack 
polytope.  In  Example  3  we  show  that  weight-reduction  inequalities  and  inequalities  (19)  are  not  equivalent. 
We  also  show  that  the  extended  weight  inequality  is  dominated  by  the  inequalities  found  using  Proposition 
6  for  this  example. 

Example  3.  Consider  the  type-I  base  inequality 

3z\  +  4,22  +  5z3  +  624  >  6,  (20) 

for  t  =  0.  Next,  we  give  examples  of  inequality  (19)  for  different  choices  of  J. 
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1.  Let  J  =  {1,4}.  Then  J*  =  J  and  ft  =  (3  +  6)  —  6  =  3.  Then  corresponding  inequality  (19)  defined  by 

this  choice  of  J  is  min{4,  3}z2  +  min{5,  3}^3  +  3,24  >  3,  or 

Z2  4“  £3  T  Z4.  P  1.  (21) 

2.  Let  J  =  {2,4}.  Then  J*  =  J  and  /?  =  (4  +  6)  —  6  =  4  Then  corresponding  inequality  (19)  defined  by 

this  choice  of  J  is  min{3,  2}zi  +  min{5,  2}^3  +  2Z4  >  2,  or 

Z\  +  Z3  +  Z4  >  1.  (22) 

3.  Let  J  =  {3,4}.  Then  J*  =  J  and  ft  =  (5  +  6)  —  6  =  5.  Then  corresponding  inequality  (19)  defined  by 
this  choice  of  J  is  min{3,  l}z\  +  min{4,  l}z2  +  Z4  >  1,  or 

Z\  +  Z2  +  Zi  >  1.  (23) 

Inequalities  (21)-(23)  dominate  the  corresponding  weight  inequalities  since  for  all  the  inequalities  there  exists 
j  £  J  \  J  such  that  f3j  >  f)'0 .  Inequality  (23)  cannot  be  obtained  by  weight-reduction  inequalities  in  [26].  On 
the  other  hand,  the  weight-reduction  inequality 

3z\  T  Z3  T  224  P  2, 

cannot  be  obtained  using  Proposition  6.  For  this  example,  the  only  valid  extended  weight  inequality  is 

z\  +  Z2  +  2^3  +  2^4  >  2, 

which  is  dominated  by  the  inequalities  (21)  and  (22). 

4.2  A  Coefficient  Update  Scheme  for  Mixed-Integer  Knapsacks  with  Variable 
Upper  Bounds 

Next,  we  consider  another  substructure  of  TPMC  consisting  of  a  mixed  integer  knapsack  and  variable  upper 
bound  constraints.  We  define  set  S2  as  follows: 


a? + a3zi  -  a° 

(24) 

j&J  jeJ 

tj  <  dj(  1  —  Zj)  Vj  £  J 

(25) 

z  £  {0,  l}|J|,ij  £  k!/1, 

(26) 

for  given  ctj  >  0  for  all  j  £  J  and  ao  >  0. 

Let  7~2  =  conv(S2 )•  We  refer  to  inequalities  in  the  form  of  (24)  as  type-II  base  inequalities.  If  we  replace 
tj  :=  Eiei-aneE^P  U  then  the  sum  of  relaxation  of  the  supply  constraints  (lc)  over  /  is  in  the  form 
of  (24)  (with  aj  =  0  for  all  j  £  J)  for  TPMC,  and  (25)  is  a  relaxation  of  the  demand  constraints  (lb).  In 
this  case,  we  observe  that  TPMC  contains  the  fixed-charge  network  flow  substructure.  Therefore,  the  lifted 
flow  cover  and  pack  inequalities  [4,  5,  13,  21,  24],  and  submodular  inequalities  [1,  27]  are  all  valid  for  TPMC. 
Furthermore,  these  inequalities  and  the  blossom  inequalities  (3)  are  in  the  form  of  (24).  Next  we  describe 
valid  inequalities  for  the  set  S2- 

Proposition  7.  Given  the  mixed-integer  set  S2,  let  J  =  {j±,  j2,  ■  ■  ■ ,  ju}  C  J  such  that  dj1  —  >  dj2  — 

otj2  >  •••  >  dju  —  cxju  and  there  exists  m  =  max{Z  £  {0,  1}  :  Y^k-i^jk  +  ]Cfc=;+i  ajk  <ao~ 

X)ieJ\jmax{dj,ai}}.  Let  M  =  {ji,  ji,  ■  ■  ■ ,  jm}  (M  =  0  if  m  =  0)  and  a  =  a0  -  J\J  max  {dj,  aj}  - 
d(M )  —  a(J  \  M).  Then  the  inequality  given  by 

Aj  +  +  a)zi  +  aizi  <a0  +  (u-m-l)a  (27) 

j£J  j£J\J 

is  valid  for  S2  • 
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Proof.  Given  a  feasible  solution  (t,z)  to  <S2,  let  J\  =  {j  £  J  :  Zj  =  1}  and  Jo  =  {j  G  J  :  Zj  =  0}.  Consider 
the  following  cases: 

1.  Suppose  that  u  —  m  —  1  >  |  Ji|.  In  this  case, 


^  +  XltAj  "l"  a)zj  +  X!  'A  A  ~~  X!  tj  +  JZ  A'  +  A  A  +  kil( 


leJ  j&J 


jeJ\J  jeJ\Ji  jeJi  j£J\J 

<  Q!o  +  |  J\  \  cX 

<  ao  +  (u  —  m  —  l)a. 


2.  Suppose  that  u  —  m  <  |  Ji|,  or  equivalently  m>  u  ~  \  J\\  =  \  Jq\.  Then, 

X/'  '  X(A  +  '’kb  •  X  AA 

J'eJ  jeJ  j&J\J 

=  X  ti +  X  a +  X  aa +  \  J^\a 

jeJ\Ji  jeJi  jeJ\J 

<  X]  max{dj, ctj}  +  d(Jo)  +  X]  a,,-  +  |«7i|a 
ieJ\J  jeJi 

=  c^o  —  a  —  J( M )  —  cr  ( J  \  Ad )  -)-  d(  Jo )  -)-  oci^Jf)  T  |  Ji|  cr 

=  a0  —  (d(M)  -  a(M))  -  (d(J0)  -  a(J0))  +  (|Ji|  -  l)a, 

where  the  first  inequality  holds  since 


X!  +  X]  AA 

jeJ\h  jeJ\J 

X  *j  +  X  aa )  +  X  ^  -  X  maxjdj,  a,}  +  d(Jo), 


VAAT 


jeJ0  jeA^ 


and  the  second  equality  holds  because  J2jeJ\j  max  {dj,  aj}  =  ao  —  a  —  d{M)  —  a(J  \  M). 
Furthermore,  due  to  the  choice  of  index  m,  0  <  a  <  djm+1  —  otj.1 .  Thus,  we  have 

m 

(m  -\Jo\)a<(m-  \Jo\)(djm+1  -  ajm+1)  <  X  (dA  -  A'J- 

fc=|>/o|+l 


Moreover,  — 


(d(M)  -  a(M))  -  (d(J0)  -  a(J0))  <-  £™=|j0|+1(difc  ~  A’J 


.  Thus  we  have 


«o  +  (l^il  -  l)a  -  [(d(M)  -  a(M))  -  (d(J0)  -  a(Jo))J 
<  ao  +  (|  Ji|  —  l)a  —  (m  —  |  Jo|)a  —  ao  +  (u  —  m  —  l)a, 


completing  the  proof. 


□ 


As  in  Proposition  6,  Proposition  7  can  be  applied  recursively  to  obtain  new  nontrivial  valid  inequalities 
for  TPMC. 

Next  we  give  an  example  illustrating  the  valid  inequalities  introduced  in  this  section. 
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Example  4.  Consider  an  instance  of  TPMC  with  a  complete  bipartite  graph,  V-\  =  {1,2},  V2  =  {1,2,3, 4}, 
s  =  (31,  20)  and  d  =  (11, 19,  8, 13).  A  valid  inequality  for  X  for  this  instance  is 

%2i  4-  X22  4-  £23  +  a; 24  4-  llzi  +  19^2  4-  823  +  1324  >  20,  (28) 

which  corresponds  to  inequality  (15)  with  I  =  {2}  and  J  =  {1,  2,  3, 4}.  Note  that  d(J)  —  s(Vi  \  I)  =  20  >  dj 
for  all  j  €  J. 

Using  (28)  as  the  type-I  base  inequality,  we  apply  the  coefficient  update  in  Proposition  6  and  let  J  =  {1,4}, 
J*  =  {1,2,4}.  Then  pi  +  Pa  =  11  +  13  =  24  and  (P 1  +  Pa)  —  Po  —  24  —  20  =  4  =  p,  and  we  obtain  the 
corresponding  inequality  (19) 

X21  4-  X22  4-  X23  4-  X24  +  72i  4"  1522  4~  823  +  924  >  16,  (29) 


which  is  valid  for  X. 

Using  (29)  as  the  type-I  base  inequality,  we  apply  the  coefficient  update  in  Proposition  6  and  let  J  =  {3, 4}, 
J*  =  {2,  3,4}.  Then  P3  +  Pa  =  8  4-  9  =  17  and  ( P3  +  Pa)  —  /3o  =  17  —  16  =  1  =  /3  and  again  we  obtain  the 
corresponding  inequality  (19) 

X21  4-  X22  4~  £23  4~  2-24  4"  72i  4~  1422  4~  723  4~  824  >  15,  (30) 


which  is  valid  for  X . 

Now,  consider  the  supply  constraint  (lc)  for  supplier  2 


£21  4-  X22  4-  X23  +  x24  <  20.  (31) 

Then  using  (31)  as  the  type-II  base  inequality  with  I  =  {2}  and  J  =  {1,2,3, 4},  we  apply  the  coefficient  update 
in  Proposition  7,  where  we  let  J  =  {2,4}.  Then  ao  —  ^2jeJ\jnaax{dj,aj}  =  cro~ {di+ds)  =  20— (114-8)  =  1. 
However,  all  demand  values  in  set  J  are  greater  than  1  so  m  =  0  and  a  =  ao  —  (d\  +  ^3)  —  «2  —  &4  = 
20  —  (11  4-  8)  —  0  —  0  =  1.  Then  we  obtain  the  corresponding  inequality  (27) 


X21  4-  X22  4-  £23  +  2^24  4“  22  4-  24  <  21, 


(32) 


which  is  valid  for  X . 

4.3  Strength  of  the  Proposed  Inequalities 

Next  we  give  several  facet  conditions  for  inequalities  (15).  Let  V.[  be  the  set  of  markets.  Observe  that  if 
s(V[ )  <  dj  for  some  j  £  Vf  then  the  demand  of  market  j  can  never  be  met  in  any  feasible  solution  to  TPMC. 
Therefore,  we  can  set  Zj  =  1  for  such  markets  and  let  V2  =  {j  £  V.[  :  ,s(V) )  >  dj}.  In  other  words,  we  remove 
the  markets  that  can  never  be  satisfied  from  the  given  set  of  markets.  Therefore,  throughout  we  make  the 
assumption  that 


s(Vi)  >  maxjey2dj.  (33) 

Let  JK  =  {j  £  J  :  dj  <  d(J)  —  s(Vi  \  /)}. 

Theorem  2.  Inequality  (15)  defines  a  nontrivial  facet  of  conv(X)  only  if  the  following  conditions  hold: 

1.  d(J)>s(V1\I). 

2.  There  exists  j  £  J  such  that  dj  >  d(J)  —  s(Vi  \  I). 

3-  s(Pi)  >  d{J)  —  maxjeJ{dj}  +  ma *-jev2\j{dj} ■ 

4.  If  s(V 1)  <  d(J)  and  I  0,  then  |J<|  >  2  and  the  sum  of  the  smallest  two  demands  in  set  J<  is  not 
greater  than  d(J)  —  s(Vi  \  I). 
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5.  Vi. 

6.  If  \  J\  =  1,  then  \Vi  \  I\  =  1. 

I-  s(V\)  >  d(J\J<) +maxj(zj<{dj}. 

8.  If  s{\ i)  =  d{J)  and  dj  >  d(J)  —  s(Vi  \  I)  for  all  j  €  J  then  |/|  <  1. 

In  addition,  if  the  following  conditions  hold,  then  (15)  is  a  facet  of  conv(TPMC): 

9.  s(Vi)  >  d(J)  —  m&XjeJ{dj}  +  ma Xjev2\j{dj}- 

10.  There  exists  J  C  J<  such  that  d(J  \  J)  >  s(Vi  \  I)  and  d(J  \  J')  >  s(Vi  \  I)  where  J'  =  J  U  {fci},,  for 
all  fci  G  J<  \  J- 

11.  s(Vi)  >  ma Xj(zv2dj. 

Proof.  Necessity. 

1.  Assume  that  d(J)  —  s(Vi  \  I)  <  0. 

From  validity  of  inequality  (15)  we  have  d(  J)  —  s  ( V\  \  I)  >  0  and  combined  with  the  assumption  we  get 
d(  J)  —  s(Vi  \  J)  =  0.  The  resulting  inequality  is  implied  by  the  nonnegativity  of  Xij  and  Zj  for  i  £  I, 
j  G  J,  { i,j )  G  E. 

2.  Assume  that  dj  <  d(J)  —  s(Vi  \  I)  for  all  j  G  J.  Under  this  assumption  inequality  (15)  reduces  to 

Z)  + z  djZj  >  d(J)  —  s(Ui  \  I).  (34) 

i<El,jeJ:(i,j)&E  j£J 

We  add  all  the  demand  constraints  (lb)  in  J, 

^  ]  xij  +  'y  'j  djZj  =  d(J).  (35) 

ieVi,jeJ:(i,j)eE  j&J 

When  we  subtract  (35)  from  (34)  we  obtain 

Z  Xij  KsiV^I).  (36) 

iev1\i,jeJ.(i,j)eE 

If  J  C  V2  then  inequality  (36)  is  weaker  than  all  the  supply  inequalities  (lc)  in  V\  \  I  combined,  because 
>  0  for  all  i  G  /, j  G  V2\J,  (i,j)  G  E.  If  J  =  V2  then  inequality  (36)  is  dominated  by  the  supply 
inequalities  5Zjev2-(»  j)eE  xij  —  f°r  all  i  G  V\ \I  unless  |Ul\/|  =  1.  However,  when  J  =  V2,  |Ul\/|  =  1 
and  dj  <  d(J)  —  s(V_ i  \  I)  for  all  j  £  J  inequality  (15)  reduces  to  a  trivial  facet. 

3.  Assume  that  s(Vi)  <  d( J)  —  ma Xjej{dj}  +  max^g^^jldj}.  Because  we  have  showed  that  there  exists 
j  £  J  such  that  dj  >  d(J)  —  s(Vi  \  I )  we  can  conclude  that  s(V i  \I)  >  d(J)  —  dj  >  d(J)  —  maxjeJ{dj}. 
Note  that  we  have  to  have  s(I)  <  ma Xj6y2\j{dj}  for  s(Vi)  <  d(J)— maxj^j{dj}+maxjev2\ j{dj}  to  hold 
because  if  s(/)  >  ma ^jev2\j{dj},  then  s(Vi)  =  s(Vi  \  J)  +  s(I)  >  d(J)  —  ma XjGj{dj}  +  maxjey2\j{dj} 
which  would  contradict  our  assumption.  Let  r*  =  argma ^-jev2\j{dj}-  Because  (15)  is  a  non-trivial 
facet,  it  is  different  from  zr *  <  1  and  there  exists  solutions  on  the  face  defined  by  (15)  with  zr *  =  0. 
Note  that  J2j&j\j<  Zj  <  1  for  any  point  to  be  on  the  face  defined  by  inequality  (15).  We  consider  the 
following  cases: 
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(a)  Ej-e  J\J<  Z3  =  ^  =  Zl  f°r  some  l  G  J\J<. 

In  this  case,  left-hand  side  of  inequality  (15)  reduces  to 

a:d'+  (min  {g?(  J)  —  s(Vi  \  Zj  +  d(J)  —  s(Vi  \  7) 

iei,jeJ.(i,j)EE 

since  l  £  J\  J<,  min{d(J)  —  s(Vi  \  I),di}  =  d(J)  —  s(Vi  \  I).  Thus  to  satisfy  inequality  (15)  at 
equality  we  must  have  E;<=j  jeJ-(i  j)eE  xij  =  0>  zj  =  0  f°r  j  G  <7  \  {/}  and 

y  Xij  =  d(J  \  {/})  <  s(Vi  \  I)  -  ( dr .  -  s(I))  =  s(Vi)  -  dr *  (37) 

where  dr»  —  s(7)  is  the  amount  of  demand  of  market  r*  that  cannot  be  satisfied  by  the  suppliers 
in  set  I.  We  obtain  a  contradiction  because  (37)  implies  that  s(Vi)  >  d{J)  —  di  +  dn  >  d(J)  — 
ma xjej{dj}  +  maxjey2\j{dj},  since  di  <  maxjej{dj}. 

(b)  EJeJ\J<  zi  ~ 

Let  J  =  {j  £  J<  :  Zj  =  1}.  Then  a  point  on  the  face  defined  by  inequality  (15)  satisfies 

Xij +J2d3  =  d(J)  -  s(v i\/). 
i€l,j€J:(i,j)€E  jej 

This  implies  that  E ,:<=/  jeJ-(i  j)ee  xij  =  d(J  \  J)  —  s(Vi  \  7)  >  0  because  otherwise  we  would  not 
have  a  feasible  solution.  Furthermore,  E»evi\j  jeJ\J-(i  j)eE  xij  =  s(Ci  \-0-  Combining  the  results 
we  observe  that  because  s(7)  <  dr*  we  cannot  send  all  the  demand  of  dr»  from  s(I)  so  some  of  the 
supply  from  s(Vi  \  I)  should  be  sent  to  dr *  but  all  the  supply  s(Vi  \  I)  is  sent  to  markets  in  J  \  J. 
We  reach  a  contradiction,  we  cannot  have  zr *  =  0. 

Suppose  that  s(V\ )  <  d(J)  and  7^0,  then  not  all  demand  in  set  J  can  be  met,  hence  E jejzj  —  1- 
Consider  the  following  cases: 

(a)  J<  =  0.  Then  inequality  E jej(d(J)  —  s(Vi  \  I))zj  >  d(J)  —  s(Vi  \  7)  dominates  inequality  (15) 
since  inequality  (15)  has  the  additional  term  E iei  jeJ-U  j)eExij  —  0- 

(b)  |J<|  =  1.  Let  J<  =  {k}.  We  apply  the  coefficient  update  in  Proposition  6  using  inequality  (15) 
as  the  type-I  base  inequality.  Let  J  =  {j,  k}  where  j  £  J\{k}.  Therefore,  /3  =  (3 j  +  dk  —  /3q  = 
d(J)  —  s(Vi  \  7)  +  dk  —  ( d(J )  —  s(Vi  \  7))  =  dk  and  the  corresponding  inequality  (19)  is 

Xij  +  (d(J)  -  s(Vi  \  7)  -  dk)zj  +  (dk  -  dk)zk  >  d(J)  -  s(Vi  \  7)  -  dk.  (38) 

ieI,jeJ:(i,j)EE  jEJ\{k} 

If  we  add  E jejdkZj  >  dk  to  inequality  (38)  we  obtain  (15).  Hence,  (15)  cannot  be  a  facet. 

(c)  |J<|  >  2  and  dj1  +dj2  >  d(J)  —  s(Vi\I)  where  dj1  and  dj2  are  the  two  smallest  demands  in  set  J<. 
We  use  the  coefficient  update  in  Proposition  6  using  inequality  (15)  as  the  type-I  base  inequality. 
Let  J  =  {j\,  j-2}-  Therefore,  (3  =  dj1  +  dj2  —  ( d(J )  —  s(Vi  \  7))  and  the  corresponding  inequality 
(19)  is 

E iEl,jEJ.{i,j)EEXij  +  E jej\j<(2{d(J)  —  s(Vi  \  7))  —  djx  —  dj2)Zj  (39) 

+  Ejej<\{ji,j2}(^  -  (^ii  +  dh  ~  {d(J)  -  s(Hi  \  I))))zj 
+(d(J)  -  s(V i  \  7)  -  dj2)zjl  +  (d(J)  -  s(Hi  \  7)  -  djl)zj2 
>  2(d(J)  —  s(Vi  \  7))  —  djl  —  dj2. 

Because  dj1  and  dj2  are  the  two  smallest  demands  we  have  J*  =  J  in  Proposition  6.  Note  that  if 
we  add  Ejey(4h  +  ~  ~  s(^i  \  I)))zj  —  dji  +  —  (d(J)  —  s(Vi  \  I))  to  inequality  (39) 

we  obtain  (15).  Hence,  (15)  cannot  be  a  facet. 
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5.  Assume  that  /  =  V\.  Then  inequality  (15)  reduces  to 

E  xij  +  E  dizi  -  dV)-  (40) 

i€Vi,jEJ:(i,j)EE  jEJ 

Inequality  (40)  is  a  relaxation  of  the  demand  equalities  (lb)  in  TPMC.  Therefore,  if  /  =  V\  then  all 
points  in  TPMC  are  on  the  face  defined  by  inequality  (15),  therefore  this  inequality  does  not  define  a 
proper  face. 

6.  Suppose  that  J  =  {j},  but  \Vi  \  I\  >  1.  Then  inequality  (15)  is 

E  - s w  \  d))zi  >  di  -  \  J)>  (41) 

iEl:{i,j)EE 


where  dj  >  s(Vi  \  I)  from  facet  condition  1.  Subtracting  the  original  demand  equality  (lb)  for  j  from 
inequality  (41),  we  get 

'y  i  xij  ~  s(^i  \  -0(1 _  zj)i 

which  is  dominated  by  VUB  inequalities  (14)  for  i  £V\  \  I. 

7.  Assume  that  s(Vi)  <  d(J  \  J<)  +  maxje  j<  {dj}.  Then  not  all  demand  for  markets  in  set  J  \  J<  and 
the  largest  demand  in  set  J<  can  be  met  at  the  same  time.  Hence,  EeJ\J<  zi  +  zm  >  1  where 
m  =  argmaxj6j<{d,}.  We  use  Proposition  6  and  inequality  (15)  as  the  type-I  base  inequality.  Let 
J  =  {l,  m}  where  l  £  J\J<  then  /3  =  d(J)  —  s(Pi  \  I)  +  dm  —  ( d(J )  —  s(V i  \  I))  =  dm.  We  obtain 

El  xij  +  El  {d{J)  —  s(Vi  \I)—dm)zj  +X^jej<\{m}  djZj  +  (dm  —  dm)zm  (42) 

iEl,jEJ:(i,j)EE  jEJ\J< 

>  d{j)  -  S(Pr  \  I)  -  dm. 

If  we  add  5Z;ej\j<  dmZj  +dmzm  >  drn  to  inequality  (42)  we  obtain  (15).  Hence,  (15)  cannot  be  a  facet. 

8.  Assume  that  s(Vi)  =  d(J),  dj  >  d(J)  —  s(Pi  \  I)  for  all  j  £  J  and  for  contradiction  |/|  >  2.  Because  of 
assumption  s(Vi)  =  d(J)  we  have  dj  >  d(J)  —  s(Vi  \  I)  —  s(Vi)  —  s(Vi  \  I)  =  s(I)  for  all  j  £  J.  Under 
these  assumptions  inequality  (15)  reduces  to  XE/  j)EE  xij+'52jej  s(I)zj  —  S(I)-  Let  I'  =  I\ {*'} 

and  I  =  {?'}  where  i'  £  I  (/'  ^  0  and  /  ^  0  because  |/|  >  2  by  assumption).  Consider  the  following 
inequalities  in  the  form  of  inequality  (15)  with  set  /  replaced  with  sets  I'  and  I" ,  respectively 

E  xij +^2(s{I)  ~  Si')zj  >  s(I)  -  Sjr,  (43) 

iEl\{i'},jEJ:(i,j)EE  jEJ 

E  Xi'i  +  El Si'  zi  -  Si' '  (44) 

jEJ:(i',j)EE  jEJ 

Inequality  (43)  is  valid  because  d(J)  —  s(Vi\I')  =  d(J)  —  s(Vi\I)  —  Si'  =  s(I)  —  Si>  >  0.  Furthermore,  the 
coefficient  of  Zj  is  min{dj,  s(I)  —  s,/}  =  s(I)  —  s#  because  of  the  assumption  dj  >  d(J)  —  s(Vi\I)  =  s(I) 
for  all  j  £  J.  Inequality  (44)  is  valid  because  d(J)  —  s(Ul \I  )  =  s(Ui)  —  s(Ui  \I  )  =  s(I  )  =  s#  >  0  and 
similarly  the  coefficient  of  Zj  is  min{sj/,dj}  =  sy ,  because  dj  >  s(I)  >  Si>  for  all  j  £  J  by  assumption. 
By  adding  inequalities  (43)  and  (44)  we  obtain  inequality  (15)  with  set  I.  Hence,  (15)  cannot  be  a 
facet. 
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Sufficiency.  We  use  the  technique  in  §7.4.3  Theorem  3.6  [19].  We  show  that  inequality  (15),  plus  any  linear 
combination  of  the  demand  constraints  -u  j)eE  xij  +  djzj  =  dj  f°r  all  j  G  V2  is  the  only  inequality  that 

is  satisfied  at  equality  by  all  points  (x,z)  feasible  to  TPMC  that  are  tight  at  (15),  i.e.,  we  show  that  if  all 
points  of  TPMC  at  which  (15)  is  tight  satisfy 

E  avxv  +  E  ^ozi  =  «.  (45) 

(i,j)£E  j€V 2 


then 


1.  otij  =  Uj,  j  G  C2  \  J,  i  G  Vi,  (i,j)  G  E, 

2.  aij  =  Uj,  j  G  J,  i  G  Vi  \  7,  (i,j)  G  E, 

3.  =  a  +  Uj,  j  G  J,  i  G  7,  ( i,j )  G  E, 

4.  ifj  =  Ujdj,  j  &V2\J 

5.  ipj  =  d(min{d(J)  —  s(Vi  \  I),dj})  +  Ujdj,  j  G  J, 

6-  a  =  a  ( d(J )  -  s(Pl  \  /))  +  ujdj- 

In  the  proof  we  consider  three  different  types  of  points  at  which  (15)  is  tight.  These  points  are  solutions 
to  TPMC  but  are  subject  to  additional  systems  of  constraints.  Throughout,  let  e  be  a  very  small  number 
greater  than  zero  unless  noted  otherwise. 

1.  Suppose  that  di  >  d(J)  —  .s(Vj  \  I)  for  l  =  arg maxje j{dj}.  Consider  a  point  where  only  markets 
j  G  {?’}  U  J  \  {/}  are  satisfied  for  some  r  G  V2  \  J  and  constraints 


xij — 0 

y  v  Xij  —  d(T)  di 


i€.Vi:(i,r)€E 


E 

jev2-.(i,j)eE 


Xij  =  0, 

iGD,jG{l}Uk2\(JU{r}) 

Xij  >  e, 

*  G  Vi\I,j  &J\{1} 

Xir  >  e, 

i  G  Cl 

Xij  Si  &i  C , 

i  G  Ci 

zi  =  4, 

j  G  {1}  u  C2  \  ( J  U  {r}) 

II 

0 

j  G  {?’}  U  J  \  {/} 

in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  1.  We  know  that  a 
solution  to  System  1  exists  from  facet  conditions  9  and  11.  For  a  solution  to  be  feasible  to  System  1  the 
demand  of  markets  j  G  { r}  U  J  \  {Z}  have  to  be  met,  i.e.,  s(Vi)  >  d(  J)  —  ma Xjej{dj}  +  ma ’X-jev2\j{dj}- 
Additionally,  we  would  like  to  change  a  given  solution  by  increasing  and  decreasing  the  x  values  by  e 
hence  the  need  for  >  relationship  in  facet  condition  9. 
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2.  Suppose  that  di  >  d(J)  —  s(V-\  \  I)  for  some  l  G  J.  Consider  a  point  where  only  markets  j  G  J  \  {/}  are 
satisfied  and  constraints 

^  xij  =  0 

iei,jeJ-.(i,j)eE 


^  (  Xij  —  d(T)  di 

i£V1\I,j£j:(i,j)£E 


Xij  0, 

*  G  Ci ,  j  G  {/}  U  V2  \  J 

Xij  >  e, 

»GCi\/,j  G  J\{/} 

^  ^  Xij  ' '  Si  e, 

*  G  Ci  \  I 

j£V2:(i,j)&E 

Zj  =  1, 

j  G  {1}  U  C2  \  J 

je  J\{1} 

in  addition  to  the  original  constraints  are  satisfied,  which 

we  refer  to  as  System  2.  We  know  that 

a  solution  to  System  2  exists  from  facet  condition  2  since 
s(Vi)  >  s(Cl  \  I)  >  d(J)  —  dj,  and  from  facet  condition  11. 

there  exists  at  least  one  j  G  J  such  that 

We  define  J  C  J  such  that  d(J\J)  >  s(C  \  I).  Due  to  the  choice  of  J  we  have  dj  <  d(J)  —  s(C  \  I) 
for  all  j  G  J  so  J  C  J<  (if  dj<  >  d(J)  —  s(C  \  I)  and  j'  G  J  then  we  cannot  have  d(  J  \J)>  s(C  \  /)). 

In  this  point,  markets  in  set  J  UV2\  J  are  rejected  and  constraints 

=  d{J  \  J)  -  s(C  \  I) 

i£l,j£J:(i,j)GE 

Y  xij  =  s(Ci  \  I) 

ieV!\i,jeJ:(i,j)eE 

Xij  —  0, 

i  G  Ci,  j  G  JU  C2  \  J 

Xij  >  e, 

i  G  Ci ,  j  G  J  \  J 

^  \  Xij  ^  Si  6, 

i  G  I 

j£j\J:(i,j)eE 

Zj  =  1, 

j  G  J  U  C2  \  J 

II 

0 

j  £  J  \J 

in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  3.  We  consider  a  set  J  such 
that  all  demand  in  set  J\JK  is  satisfied  and  J2ieI  jej-(i  j)&E  xij  >  0-  This  is  possible  due  to  facet  conditions 
7,  11,  and  non-negativity  of  x  variables. 

In  order  to  establish  the  values  of  the  coefficients  atij,  ipj  and  d,  we  construct  a  feasible  solution  to  the 
given  systems  1,  2  and  3.  Then  a  small  change  in  the  solution  is  made.  By  evaluating  (45)  at  both  solutions, 
which  are  on  the  face  defined  by  (15)  and  comparing  the  resulting  expressions,  the  possible  values  of  a  set  of 
coefficients  are  obtained. 

We  start  by  showing  that 

1.  atij  =  Uj,  j  G  C2  \  J,  *  G  Vu  ( i,j )  G  E. 

Consider  any  solution  to  system  1  with  any  market  r  G  C2  \  J  that  is  satisfied.  Choose  arbitrary 
suppliers  i,i'  G  V\  such  that  (i,r),(i',r)  G  E.  Construct  a  new  point  by  decreasing  the  flow  on  edge 
(i,r)  by  e  and  increasing  the  flow  on  edge  (i',r)  by  e.  Note  that  this  point  is  also  on  the  face  defined 
by  inequality  (15).  Thus, 

otij  =  Uj,j  G  C2  \  J,  *  G  Cl,  (i,j)  G  E. 
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2.  atij  =  Uj,  j  €  J,  i  £Vi\I,  ( i,j )  G  A’.  Note  that  if  |Vi  \  7|  =  1,  then  =  Uj,  j  G  J  trivially  holds.  We 
condition  on  the  number  of  markets  in  set  J. 

(a)  J  =  {/c}.  Note  that,  from  facet  condition  6,  we  have  V]  \  /|  =  1,  so  the  result  holds. 

(b)  |J|  >  2.  By  assumption,  \Vi  \  I\  >  1.  Due  to  facet  condition  2  there  exists  k  G  J  such  that 
dk  >  d(J)  —  .s(Vj  \  I).  We  consider  a  solution  to  system  2  with  l  =  k.  Choose  any  market 
j  G  J\  {k},  any  suppliers  i,i'  G  V\  \  I  such  that  ( )  G  E.  Make  an  e-change  of  flow 
between  the  two  suppliers  i,  i'  and  market  j.  Thus, 

an  =  uj,j  G  J\{k},i  G  Vi  \I,  ( i,j )  G  E. 

Next  we  show  that  cmk  =  Uk  for  all  i  G  V\  \  I.  If  there  exists  another  j*  such  that  dj *  > 
d{J)  —  s(Vi  \  I),  j*  ^  k  then  we  consider  a  point  satisfying  System  2  with  l  =  j* ,  and  use 
the  same  argument  as  before  to  show  that  cuk  =  Uk  for  all  i  G  V\  \  I.  If  no  such  j*  exists 
then  dj  <  d(J)  —  s(V \  \  I)  for  all  j  G  J  \  {fc}.  In  this  case  k  is  the  only  market  in  J  with 
dk  >  d(J)  —  s{V\  \  I).  Then  from  facet  condition  7  we  know  that  there  exists  a  solution  to  a 
variant  of  System  3  with  J  C  J<  \  {j}  for  some  j  G  J\{k}  (in  which  we  set  e  =  0  in  case  facet 
condition  7  is  satisfied  at  equality),  where  along  with  market  k  we  can  satisfy  at  least  one  more 
market,  j.  Choose  suppliers  i,  i’  G  Vi  \  I  such  that  ( i ,  k),  (i' ,  k),  ( i,j ),  (i',  j)  G  E.  Decrease  flow  on 
edges  («,  j),  ( i',k )  by  e  and  increase  flow  on  edges  (i,  k),  ( i',j )  by  e.  Note  that  since  we  are  using 
a  solution  to  a  variant  of  system  3  in  which  we  set  e  =  0  inequality  (15)  is  still  tight.  Thus, 

OCik  ( Xjj  k  “b  Otj'j  O^ik  ^ j  i'k  T  ^ik  ^i'k  d. 

Therefore,  =  Uk  for  all  i  &V\\I. 

3.  ctij  =  a  +  Uj,  j  G  J,  *  G  /,  (i,  j)  G  E. 

Consider  a  solution  to  system  3  with  J  C  J<.  Choose  any  market  j  G  J  \  J,  any  two  suppliers  i,i'  G  / 
such  that  (i,  j),  (i' ,  j)  G  E.  Make  an  e-change  of  flow  between  the  two  suppliers  i,  i'  and  market  j. 
Thus, 

=  ®),j  G  J\J,i€  /,  (i,  j)  G  E. 

Let  a]  =  OLj  +  Uj,  j  G  J\  J.  Facet  condition  10  and  definition  of  J  (i.e.  J  C  J<)  implies  that  for 
any  k\  G  J<  we  can  redefine  J  to  either  include  k\  or  not.  More  specifically,  if  ki  G  J  then  market  ki 
is  rejected.  To  show  that  for  all  i  G  I,  (i, /ci)  G  E  we  choose  another  J  such  that  ki  ^  J. 

Using  the  same  argument  as  before  we  obtain  for  all  sGl,  ( i ,  k\)  G  E.  As  a  result,  we  have 

shown  that  cty  =  aj,j  G  J, i  G  /,  (i,j)  G  E.  Next  we  show  that  ay  =  a,  j  G  J\J.  Choose  any  markets 
j,j ’  G  J  \  J,  any  suppliers  i  G  V\\I,  i'  G  I  such  that  ( i',j ),  ( i,j (*'  ,j')  G  E.  Decrease  flow  on 
edges  ( i,j '),  ( i',j )  by  e  and  increase  flow  on  edges  ( i',j ')  by  e.  Thus, 

a jj  OLij!  a %! j  — f—  Oiji j<  —  Uj  ( t j '  /  —  0. 

By  again  using  aj  =  dj  +  Uj  and  aj,  =  dj/  +  Uj / ,  we  obtain 

by  =  ®j'- 

Since  j  and  j'  can  be  chosen  as  any  market  in  J  \  J  we  conclude  that  dj  =  d,  j  G  J  \  J .  Furthermore, 
since  as  before  we  can  rearrange  set  J  to  include  or  not  include  any  k\  G  J<  we  get  dj  =  d,  j  G  J. 

4.  ^j  =  Ujdj,  j  G  V 2  \  J-  We  rewrite  (45)  using  the  information  obtained  until  now  and  get 

d  ^  ayj  +  ^  UjXij  +  i’jZj  = (46) 

ieI,jeJ:(i,j)eE  j€V 2 
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Consider  any  solution  to  system  1  with  any  market  r  £  V2  \J  that  is  satisfied.  Then  we  construct  a  new 
solution  based  on  this  solution  where  we  set  zr  =  1  and  xir  =  0  for  all  i  £  V\,  (i,  r)  £  E  and  all  other 
variables  remain  the  same.  Note  that  this  solution  is  also  on  the  face  defined  by  (15)  since  r  £  V2  \  J 
and  the  new  solution  is  a  solution  to  system  2.  We  compare  face  (45)  evaluated  at  these  two  solutions. 
Thus, 


Ur  Xir  —  1pr  =  0. 

i£Vi:(i,r)£E 

Because  r)e£  xir  =  dr  we  have  ipr  =  urdr. 

5.  ipj  =  d(min{<i(J)  —  s(Vi  \I),dj })  +  Ujdj,  j  £  J. 

We  consider  2  cases. 

(a)  dj>  <  d{J)  —  s(Vi  \  I)  for  some  j'  £  J. 

We  consider  a  solution  to  system  3  with  J  such  that  d(J)+dj>  <  d(  J)  —  s( V\  \  I) .  This  is  a  feasible 
solution  due  to  facet  condition  10  where  k\  =  j' .  We  evaluate  (46)  at  this  solution  and  obtain 

a(d(J  \  J)  —  s(Vi  \  /))  +  ujxij  +  4>j  =  &. 

iev1,jeJ\J-.(i,j)eE  jeJuv2\j 

Then  we  use  the  same  solution  except  now  we  set  zy  =  1,  X0  =  0,  i  £  Vi,  ( i,j' )  £  E  (so  we 
redefine  J  as  J'  =  J  U  {j'})  and  J2iei  jeJ-.(i,j)eE  xij  =  d{J  \  J)  —  s(Vi  \  I)  —  dj>  and  evaluate 
(46)  again.  Note  that  this  solution  is  also  on  the  face  defined  by  (15)  because  we  had  zy  =  0, 
T,iei,jej-.(i,j)eExij  =  d(J\J)  -  s(Vi  \  I)  and  we  changed  it  with  zy  =  1,  i:-r'j  = 

d(J  \  J)  —  s(Vi  \  I)  —  dj>  and  the  coefficient  of  zy  is  dy  in  inequality  (15).  Thus, 

a{d(J  \  J)  —  s(Vi  \I)  —  dy)  +  YlieVi, jeJ\J':(i,j)eE  ujxij 

+  X^eiuv2\J  +  ’’Pj'  = 

Taking  the  difference  between  (46)  evaluated  at  these  two  solutions,  we  obtain 

Ipy  =  adji  +  Uy  Xiji  =  adj’  +  Uydy . 


(b)  dj<  >  d(J)  —  s(Vi  \  I)  for  some  j'  £  J. 

We  consider  a  solution  to  system  3  with  any  feasible  J  such  that  the  right  hand  side  of  in¬ 
equality  Eiei  jeJ-(ij)eExij  =  d(J  \  J)  —  s(Vi  \  I)  is  nonnegative  and  market  j'  is  satisfied.  In 
the  solution  we  can  set  Thiel  jeJ-U  j)eExij  =  y)£E  xij'  ■  This  is  a  feasible  solution  since 

dy  >  d{J)  —  s(Vi  \  I)  by  assumption  and  we  know  that  for  inequality  (15)  to  be  tight  we  cannot 
have  T,ieijeJ-.(i,j)eExij  >  d{J )  -  s(Ci  \  I).  Hence,  J2iei,jeJ:(i,j)eExij  <  d{J)  -  s(Vi  \  I)  and  we 
can  choose  a  solution  in  which  a  part  (or  all)  of  the  demand  of  market  j1  is  met  by  suppliers  in  set 
I.  We  use  xpj  =  adj  +  Ujdj  for  all  j  £  JK  and  recall  that  markets  in  set  J  C  J<  are  rejected.  We 
evaluate  (46)  at  this  solution  and  obtain 

a(d(  J  \  J)  —  s(Vi  \  J)  +  d(  J))  +  Uj'  x^' +  ujxij 

ieVi\I,jeJ\j:(i,j)eE 

+  Ujdj  =  a. 

jeJuv2\J 
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Then  we  use  the  same  solution  except  now  we  set  zy  =  1,  zq  =  0,  q  G  j  (this  is  still  a  feasible 
solution  since  s(Vi)  >  s(Vj  \  I)  >  d(J)  —  dy  by  assumption,  i.e.,  once  market  j'  is  rejected  all  other 
markets  in  set  J  can  be  satisfied)  and  x*j  ~  0  (implying  that  xn’  =  °) 

and  reevaluate  (46).  Note  that  this  solution  is  also  on  the  face  defined  by  (15)  because  we  had 
zy  =  0,  zq  =  1,  q  G  J,  Yjiei,jeJ-.(i,j)eExij  =  d(J  \  J)  -  s(V i  \  I)  and  we  changed  it  with  zy  =  1, 
zq  =  0,  q  G  J,  J2iei  jeJ-(i  j)&E  Xii  ~  0  and  the  coefficient  of  zy  is  d(  J)  —  s(Vi  \  /).  Thus, 

d(0)  +  0  +  UjXij  +  ujdj  +  Vy  =  d. 

iev1\i,jeJ\{j’}-.(i,j)eE  jev2\j 

Taking  the  difference  between  (46)  evaluated  at  these  two  solutions,  we  get  a(d(J)  —  s(Vi  \  /))  + 
uj'  Yliev1:(i,j')eE  xij'  ~  '^iev1,jeJujXii  +  J2j^jujdj  -  i>y  =  0-  Because  EievyyjqeE  xij  =  dy 
and  T.ieVujeJ  UJXH  =  T,jej  uidj  we  have  =  a{d{J)  -  s(V 1  \  /))  +  uydy. 

6-  &  =  a  ( d{J )  -  s(Vj  \  /))  +  J2jev2  uJdr 
Rewriting  equality  (45),  we  get 

d  ( Yliei,jeJ-.(i,j)eE  xn  +  EjeJminMJ)  -  s(Ri  \  I),dj}  z^  (47) 

+  E( i,j)eE-.jev2  ujxij  +  J2jev2  uidiz3  =  d- 

Evaluating  (47)  at  any  point  ( x ,  z)  feasible  to  TPMC  that  is  tight  at  inequality  (15)  gives 

d  (d(J)  -  s(F l  \  I))  +  uo  Xii  +  dizi  = 

jSV 2  \ieVi:(i,j)eE  J 

From  equality  (lb)  in  the  definition  of  TPMC  we  have  J2iev1:(i,j)eExij  +  djzj  =  dj  for  all  j  G  14- 
Thus,  a  =  a  (d(  J)  -  s(Vj  \  /))  +  J2jev2  uidj- 

□ 

Our  next  result  shows  that  the  coefficient  update  scheme  in  Proposition  6  is  neither  lifting  nor  coefficient 
strengthening.  We  show  that  both  a  type-I  base  inequality  (16)  and  the  corresponding  inequality  (19)  can 
be  facets  of  71  under  certain  conditions. 

Proposition  8.  If  the  following  conditions  hold ,  then  type-I  base  inequality  (16)  and  the  corresponding 
inequality  (19)  are  facets  of  T\. 

1.  If  there  exists  j  G  J*  \  J  with  (dj  <  po  then  f3j  —  P  <  P0  and  P(J  \  {jP,jP- 1})  +  Pj  <  Po  where 
J  =  {ji,32,  ■  ■  ■  ,jP}  and  ph  <fij2<---<  pjp. 

2.  For  all  j  G  J  \  J* ,  pj  <  p0  and  (3(J  \  {jp})  +  Pj  <  (30. 

Proof.  We  first  show  that  there  exists  dim(fT\)  =  |J|  +  1  many  affinely  independent  points  that  satisfy 
inequality  (19)  at  equality.  Consider  the  following  points: 

•  Let  t  =  0,  z.j  =  1  for  all  j  G  J,  Zj  =  0  for  all  j  G  J\J.  In  this  case,  the  left-hand  side  of  inequality  (19) 
is  p( J)  -pP  =  p0  + p -pP  =  p0- (p-l)P  =  Po- 

•  For  each  f  G  J,t  =  py~P,  zy  =  0,  Zj  =  1  for  all  j  G  J\{/},  z:)  =  0  for  all  j  G  J\J.  In  this  case,  the  left- 
hand  side  of  inequality  (19)  is  Py—  p+p(J\{j'})—  (jp—  l)P  =  p{J)—pp  =  P0+P—pP  =  Po—{p—\)p  =  p'0. 
This  point  also  satisfies  type-I  base  inequality  (16)  at  equality. 

•  For  each  j'  G  J*  \  J  we  consider  two  cases: 
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1.  /3j>  =  fa. 

Let  t  =  0,  zy  =  1,  Zj  =  0  for  all  j  G  J  \  {/}.  The  left-hand  side  of  inequality  (19)  is  min{(/3y/  — 
P),Po)  =  min{(/30  —  P),Po  —  {p  —  l)/3}  =  Po  —  {p  —  1  )P  since  p  is  the  number  of  elements  in  set 
J  and  p  >  2,  for  J  to  be  a  minimal  cover.  This  point  also  satisfies  type-I  base  inequality  (16)  at 
equality. 

2.  py  <  po- 

Let  t  =  Po-P(J\{jp,jp-i})~Pj zj  =  1,  for  all  j  G  J\{jP,jP- 1},  zjp  =  0,  zjp_1  =  0,  zy  =  1,  Zj  =  0 
for  all  J\(  JU{/}).  From  facet  condition  1  we  have  by  —p  <  P0  hence  the  left-hand  side  of  inequality 
(19)  is  p0-p{J\{jp,  jp-i})~Pj'+P(J\{jp,  jp-i})-(p-2)p  +  Py -P  =  p0  —  (p—l)p  =  p'0.  Note 
that  due  to  facet  condition  1,  t  >  0.  Furthermore,  this  point  also  satisfies  type-I  base  inequality 
(16)  at  equality. 

•  For  each  j'  G  J  \  J*  first  observe  that  pj'  <  Po  since  by  definition  of  J* ,  /3y  <  pjp  <  po ■  Let 
t  =  Po~  P{J  \  {jP})  -  Pj’,  Zj  =  1,  for  all  j  G  J\  {jp},  zjp  =  0,  zy  =  1,  Zj  =  0  for  all  J  \  ( J  U  {/}). 
From  facet  condition  2  we  have  pj /  <  P0  hence  the  left-hand  side  of  inequality  (19)  is  P0  —  P(J\{jP})  — 
Pj '  +  p(J  \  {jp})  —  (p  —  1  )p  +  Pj'  ==  Po  —  (p  —  1  )P  =  Po-  Note  that  due  to  facet  condition  2,  t  >  0. 
Furthermore,  this  point  also  satisfies  type-I  base  inequality  (16)  at  equality. 

In  total  we  have  described  1  +  |  J|  +  |  J*  \  J\  +  |  J  \  J*\  =  \  J\  +  1  many  points.  It  is  easy  to  see  that  these 
points  are  affinely  independent.  Furthermore,  except  for  the  first  described  point  ( t  =  0,  Zj  =  1  for  all  j  G  J, 
Zj  =  0  for  all  j  G  J\J)  all  the  other  |  J\  many  points  also  satisfy  type-I  base  inequality  (16)  at  equality.  If 
we  replace  the  first  point  with  the  point  t  =  Po,  Zj  =  0  for  all  j  G  J,  which  satisfies  the  type-I  base  inequality 
at  equality,  then  we  still  get  |  J|  + 1  many  affinely  independent  points.  Hence,  both  the  type-I  base  inequality 
(16)  and  the  corresponding  inequality  (19)  are  facets  of  71  under  conditions  1  and  2.  □ 

Suppose  that  inequality  Yljejtj  —  ao  is  given  as  a  type-II  base  inequality  in  the  form  of  (24)  for 
set  <S2,  where  ay  =  0  for  all  j  G  J.  Assume  that  there  exists  J  and  m  such  that  a0  >  d(J  \  J)  and 
ao  —  d(J\  J)  <  maXjej{dj}.  These  conditions  imply  that  m  =  0  and  a  =  ao  —  d(J\  J).  Then  we  obtain  the 
corresponding  inequality  (27) 


/L 1  i  +  12  azj  <  ao  +  (|  J\  —  1  )a,  (48) 

JtJ  jeJ 

which  is  valid  for  under  these  assumptions. 

Proposition  9.  Inequality  (48),  valid  for  52,  defines  a  facet  o/72  only  if 

1.  J^0. 

In  addition,  if  the  following  conditions  hold  then  (48)  is  a  facet  o/72- 

2.  a0  <  d(J  \J)  +  min^.g  j{dj}, 

3.  ao  <  d{J  \J)  +  maxjeJ{dj}  -  j 

4-  \J\J\>2. 

Proof.  Necessity. 

1.  Assume  that  J  =  0.  Then  inequality  (48)  reduces  to 

12  Aj  <  on  —  ol.  (49) 

j£J 
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This  case  implies  that  a  =  ao  —  d(J\J)  =  ao  —  d(J).  Thus,  inequality  (49)  becomes  ejtj  <  d(J)  which 
is  dominated  by  tj  +  djZj  <  dj  for  all  j  £  J. 

Sufficiency.  We  show  that  there  exists  dimfiTf)  =  2|«7|  many  affinely  independent  points  that  satisfy 
inequality  (48)  at  equality.  Let  e  >  0  be  a  very  small  number  and  j*  =  arg max.g  j{dj}  ( j *  exists  due  to 
facet  condition  1).  Consider  the  following  points: 

•  For  each  j'  e  J ,  let  zy  =  0,  ty  =  a0-d{J\J),  Zj  =  1,  j  £  J\{j'},  tj  =  0,  j  £  J\{j'},  Zj  =  0,  j  £  J\  J, 
tj  =  dj,  j  £  J\J.  Note  that  this  is  a  feasible  solution  due  to  the  assumption  that  «o  >  d(J\J)  and 
facet  condition  2.  Furthermore,  for  each  such  point  we  construct  another  point  by  increasing  ty  by  e 
and  decreasing  any  tj,  j  £  J  \  J  by  e  (j  exists  due  to  facet  condition  4).  This  gives  2|  J|  many  points. 

•  For  each  j"  £  J\J,  let  zy<  =  1,  ty  =  0  ,_zj  =  0,  j  £  ( J  \  ( J  U  {/'}))  U  {j*},  tj  =  dj,  j  £  J  \  (J  U  {j"}), 
tj*  =  «o  —  d(J  \J)  +  dj ",  Zj  =  1,  j  £  J  \  {j*},  tj  =  0,  j  £  J  \  {j*}.  Note  that  this  is  feasible  due  to 
facet  condition  3.  For  each  such  point  we  construct  another  point  by  increasing  tj *  by  e  and  decreasing 
any  tj,  j  £  J\(J  U  {/'})  by  e  (j  exists  due  to  facet  condition  4).  This  gives  2|  J  \  J\  many  points. 

It  is  easy  to  see  that  these  points  are  affinely  independent. 

□ 

Now,  suppose  that  we  start  with  a  type-II  base  inequality  J2iei  jeJ-(i  j)eE  xij  —  s(-0  'n  Proposition  7. 
Note  that  inequality  jeJ-(i  j)eE  xij  —  s(-0  a  relaxation  of  the  supply  constraints  (lc).  Let  tj  = 

j)eE  xij  and  aj  =  0  for  all  j  £  J  in  inequality  (24).  Suppose  that  there  exists  J  and  m  such  that 

s(I)  >  d(J\J)  and  s(I)  —  d(J\J )  <  rna x-ej{dj}.  These  conditions  imply  that  m  =  0  and  a  =  s(I)  —  d(J\J). 
Then  we  obtain  the  inequality 


Xii  +  Ylazi  ~  +  “  1)a’ 

i£l,j£J:(i,j)eE  j£j 


(50) 


which  is  valid  for  X . 

Proposition  10.  Inequality  (50),  valid  for  X,  defines  a  facet  of  conv(X)  only  if 

1.  J  0. 

In  addition,  if  the  following  conditions  hold  then  (50)  is  a  facet  of  conv(X): 

2.  s(Pi)  >  d(J  \J)+  ma xje(V2\J)uj{dj}, 

3.  s(I)  <  d(J  \J)  +  mmjeJ{dj}, 

4-  8(1)  <  d(J  \  J)  +  ma xjeJ{dj}  -  max^^jjdj}. 

Proof.  Necessity. 

1.  If  we  replace  tj  by  Y^iei-U  j)eE  xij  f°r  &H  3  ^  J  ancl  ao  by  s(-0  we  can  use  the  same  argument  as  in 
the  necessity  of  facet  condition  1  in  Proposition  9. 

Sufficiency.  For  the  proof  we  use  §/.4.3  Theorem  3.6  [19].  We  show  that  inequality  (50),  plus  any  linear 
combination  of  the  demand  constraints  j)eE  xij  +  djZj  =  dj  for  all  j  £  is  the  only  inequality  that 

is  satisfied  at  equality  by  all  points  (x,z)  feasible  to  TPMC  that  are  tight  at  (50),  i.e.,  we  show  that  if  all 
points  of  TPMC  at  which  (50)  is  tight  satisfy 

'y ^jXij  +  y  ]  MjZj  =  a,  (51) 

(i,j)£E  j&V2 


then 
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L  A ij  =  Uj,  j  G  C2  \  J,  i  £  Cl,  ( i,j )  G  E, 

2.  A  ^  =  uj,j  €  J,ieV l  \  /,  (*,  j)  G  E, 

3.  Ay  =  A  +  Uj,  j  £  J,  i  £  I,  (i,j)  £  E , 

4.  Wj  =  Ujdj,  j  £V2\  J, 

5.  Wj  =  Aa  +  Ujdj,  j  G  J, 

6.  A  =  A  (s(I)  +  (|  J|  -  l)a)  +  £ieV-2  ujdj. 

In  the  proof  we  consider  four  different  types  of  points  at  which  (50)  is  tight  that  make  use  of  the  facet 
conditions.  Throughout,  let  e  be  a  very  small  number  greater  than  zero  unless  noted  otherwise. 

1.  Consider  a  point  where  only  markets  j  G  J  \  J  U  {r}  are  satisfied  for  some  r  £  V2  \  J,  and  constraints 


y %ij  —  d^j  \  j ) 

x  i^  —  dj‘ 

i£Vi:(i,r)£E 

Xij  0, 

Xij  >  e, 

Xir  ^  Ef 

^  '  'Gy  '  -  S  i  e, 


j£V2:(i,j)£E 


Zj  =  1, 

=  0, 


i  G  Ci,  j  G  J  U  V2  \  (J  U  {r}) 

i£l,j£j\J 

i  G  Cl 
i  G  Ci 

JG  JUC2\(JU{r}) 
j  G{r}UJ\J 


in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  1.  We  know  that  a 
solution  to  System  1  exists  from  assumption  s(/)  >  d(  J  \  J)  and  facet  condition  2. 

2.  Consider  a  point  where  only  markets  j  G  J  \  J  are  satisfied,  and  constraints 

y  i  xij  =  d(j  \  j) 

i£l,j£J:(i,j)£E 

Xij  =0,  i  G  Cl,  j  G  JU  C2  \  J 

i  &  I,j  £  J  \  J 
i  £  I 

j  £  J  U  Ct  \  J 
j  £  J  \  J 

in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  2.  We  know  that  a 
solution  to  System  2  exists  from  assumption  s(/ )  >  d{J  \  J). 


Xij  '  C, 

y  \  Xij  '  '■  €, 

jev2-.(i,j)eE 

Zj  =  1, 
=  0, 
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3.  Consider  a  point  where  only  markets  j  €  J  \  J  U  {/}  are  satisfied  for  some  l  €  J,  and  constraints 


E  Xi3  = 

iei,jeJ-.(i,j)eE 

E 

Xij  =  d(J\J)  +  di-  s(I) 

)€E 

cT 

II 

i£V1,j£j\{l}UV2\J 

Xij  >  e, 

i  £  Vi ,j  £  J  \  J  U  {/} 

^  (  Xij  ^  Si  6, 

jev2-.{i,j)eE 

*  €  Vi  \  / 

Zj  =  1, 

j  £  J\{l}UV2\J 

cT 

II 

j  £  J  \  J  U  {?} 

in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  3.  We  know  that  a 
solution  to  System  3  exists  from  facet  conditions  2  and  3. 

4.  Consider  a  point  where  only  markets  j  £  J\(J  U  {j'})  U  {l*}  are  satisfied  for  l*  =  argmax-e  j{dj}  and 
some  j'  £  J  \  J,  and  constraints 


s(I) 

s(I ) 

=  0,  i£V1,j£{j'}UJ\{l*}UV2\J 

=  1,  j  £{f}UJ\{l*}UV2\J 

=  0,  j  £  J\(JU  {/})  U  {l*} 

in  addition  to  the  original  constraints  are  satisfied,  which  we  refer  to  as  System  4.  We  know  that  a  solution 
to  system  4  exists  from  facet  conditions  2  and  4. 

1.  A ij  =  Uj,  j  £  V2  \  J ,  *  £  Vi,  (i,  j)  £  E. 

Consider  any  solution  to  system  1  with  any  market  j  =  r  £  V2  \  J  that  is  satisfied.  Choose  arbitrary 
suppliers  i,i'  £  Vi  such  that  ( i',j )  £  E.  Construct  a  new  point  by  decreasing  the  flow  on  edge 

(i,j)  by  e  and  increasing  the  flow  on  edge  by  e.  Note  that  this  point  is  also  on  the  face  defined 

by  inequality  (50).  Thus, 

\j  =  Uj,j  £V2\J,i£  Vi,  (i,j)  £  E. 

2.  A  ^  =  uj,  j  £  J,  i  £  Vi\I,  (■ i,j )  £  E. 

Consider  any  solution  to  system  3  with  market  j  £  satisfied  for  some  l  £  J.  Choose  arbitrary 

suppliers  i,i'  £  V\  \I  such  that  (i,j),  (i',j)  G  E.  Construct  a  new  point  by  decreasing  the  flow  on  edge 
(i,j)  by  e  and  increasing  the  flow  on  edge  (i',j)  by  e.  Note  that  this  point  is  also  on  the  face  defined 
by  inequality  (50)  since  i,  i’  £  V\  \  I.  Thus, 

A  ij  =Uj,j  £  J\JU{l},i  €  Vi  \  /,  (i,j)  £  E. 

Note  that  since  we  can  use  the  above  argument  for  any  l  £  J,  we  have  Xu  =  iq  for  all  l  £  J,  i  £  Vi  \  I, 
(■ i,l )  £  E. 


E 


E 

i€V!\I,j€J:(i,j)€E 


i€l,j€J:(i,j)€E 

Xij  =  d(J  \  J)  +  di *  —  dj’  — 


Xi 
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3.  A ij  —  A  +  Uj ,  j  £  <7,  i  £  /,  (f,  j)  €  77. 

Consider  any  solution  to  system  2.  Choose  arbitrary  suppliers  i,i'  £  I  such  that  ( i,j ),  (i',  j)  €  E  for 
j  £  J  \  J.  Construct  a  new  point  by  decreasing  the  flow  on  edge  (i,  j)  by  e  and  increasing  the  flow  on 
edge  (i1 ,  j)  by  e.  Note  that  this  point  is  also  on  the  face  defined  by  inequality  (50).  Thus, 

A  ij  =  Xj,  j  £  J  \J,i  £  I,  (i,j)  £  E. 

Next  we  consider  a  solution  to  system  3  with  e  =  0.  Choose  arbitrary  suppliers  i ,  i'  £  I  and  market 
j  £  J\J  such  that  (i,  j),  ( i,l ),  (i' ,  l )  €  E.  Construct  a  new  point  by  decreasing  the  flow  on  edges 

( i',l )  by  e  and  increasing  the  flow  on  edges  ( i,l )  by  e.  Note  that  this  point  is  also  on  the 

face  defined  by  inequality  (50).  Thus, 

— A  ij  +  Xu  +  A  uj  —  Xui  =  —A  j  +  Xu  +  Aj  —  A  m  =  Xu  —  X  ui  =  0. 

Because  l  is  any  market  in  set  J,  A  ij  =  A  j,  j  £  J,  i  £  I,  ( i,j )  €  E. 

Let  A]  =  Xj+Uj,  j  £  J.  Next  we  show  that  A  j  =  A,  j  £  J.  We  consider  a  solution  to  system  3  with  e  =  0. 
Choose  any  markets  j,j'  £  J,  any  suppliers  i  £  V\\I,  i'  £  I  such  that  (*,,j  0  G  E. 

Decrease  flow  on  edges  ( i,j '),  ( i',j )  by  e  and  increase  flow  on  edges  (*,  j),  (i' ,j')  by  e.  Thus, 

A  ij  —  Xij i  —  X  uj  +  A  uj'  =  Uj  —  Uj i  —  Xj  +  A  j,  =  0. 

By  again  using  A]  =  Xj  +  Uj  and  A],  =  Ay  +  tty,  we  obtain 


4.  ujj  =  Ujdj,  j  £V 2  \  J. 

We  rewrite  (51)  using  the  information  obtained  until  now,  and  get 

A  ^2  t LjXij  +  u)jZj  =  X.  (52) 

iei,jeJ-.(i,j)eE  (i,j)£E  i&v. 2 

Consider  any  solution  to  system  1  with  market  r  £  V%  \  J  that  is  satisfied.  Then  we  construct  a  new 
solution  based  on  this  solution  where  we  set  zr  =  1  and  Xir  =  0  for  all  i  £  V±,  (i,r)  £  E  and  all  other 
variables  remain  the  same.  This  is  a  solution  to  System  2.  Thus  this  solution  is  also  on  the  face  defined 
by  (50).  We  compare  inequality  (51)  evaluated  at  these  two  solutions.  Thus, 

u ^^2  Xiy  ay  —  0. 
iEV\:(i,r)EE 

Because  J2ieVl-.(i  r)eE  x^r  =  we  have  ur  =  urdr,  r  €  V2  \  J. 

Next  we  show  that  coj  =  Ujdj,  j  £  J\  J.  First  we  consider  a  solution  to  system  3  where  we  choose 
l  =  l*  =  argmaxj6j{<ij}.  This  is  a  feasible  choice  due  to  facet  condition  2.  We  evaluate  (52)  at  this 
solution,  and  get 

X(s(I))  +  UjXij  +  (53) 

ieV1,jeJ\Ju{l*}:(i,j)EE  jeV2\JUJ\{l*} 

Next  we  consider  a  solution  to  system  4  where  some  market  j'  £  J  \  J  is  rejected.  We  evaluate  (52)  at 
this  solution,  and  obtain 

+  ^2  UjX^  +  ^2  Uj  +  Wj>  =  A.  (54) 

iev1,jeJ\(Ju{j'})u{f}-.(i,j)eE  jev2\JuJ\{i *} 

We  subtract  (54)  from  (53)  and  obtain  Uj /  j')ee  xij'  ~  wi'  =  0-  Because  X^evWi  j')eE  xij'  =  dj> 

we  have  u>y  =  Uj>dj>,  f  £  J  \  J. 
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(55) 


5.  utj  =  Aa  +  ujdj,  j  £  J. 

Consider  any  solution  to  system  3  with  any  market  l  £  J  that  is  satisfied.  Then  (51)  reduces  to 

A  (s(/))  +  55  ujxij  +  55  uj  =  A. 

iev1,jeJ\Ju{iy.(i,j)eE  jev2\JuJ\{i} 

We  also  consider  a  solution  to  system  2  where  market  l  £  J  is  rejected.  Then  (51)  reduces  to 

A(d(J\J)+  55  ujxij  +  55  =  A-  (56) 

ieVltjeJ\J:(i,j)£E  j£V2\JUJ 

We  subtract  (56)  from  (55)  and  obtain,  A (s(/)  —  d(  J  \  J))  +  u;  5^»evi-(i  l)^E  Xil  ~  w/  =  0.  Since  s(/)  — 
d(  J  \J)  =  a  and  5Zievi-(i  ;)<=£  Xil  =  ^ 1  we  conclude  that  w;  =  Aa  +  for  l  £  J. 

6.  A  =  A  (5(7)  +  (|  J|  -  l)a)  +  Y,j£v2  uodr 

We  rewrite  (51),  and  get 


A 


E 


E  aZ3  +  E  UiXV  +  E  U3d3Z3  =  'V 


(57) 


yi£l,j£j:(i,j)£E  )  ( i,j)£E  j£V2 

Evaluating  (57)  at  any  point  (a:,  2)  feasible  to  TPMC  that  is  tight  at  inequality  (50)  gives 


A  (s(/)  +  (|  J|  -  l)a)  +  55  u3  (  E 


xij  +  djZj  I  =  A. 


j£V2  \i£Vi:(i,j)£E 


From  the  definition  of  TPMC  we  have  'Yhiev1-(i  j)eExij  +  djZj  =  dj  for  all  j  £  V2.  Thus,  A  = 
A  ^s(I)  +  (|  J|  —  l)a^  +  Yhj£v2  uJdJ- 

□ 


Even  though  Propositions  6  and  7  are  general  results  for  mixed-integer  cover  and  knapsack  sets  <S>i  and 
S‘2 ,  we  observed  that  many  of  the  facets  for  TPMC  can  be  derived  from  the  recursive  application  of  these 
results. 

Example  4.  (Continued.)  Observe  that  inequalities  (28),  (29)  and  (30)  satisfy  all  the  conditions  given  in 
Proposition  8  and  inequality  (32)  satisfies  all  the  conditions  given  in  Proposition  10,  and  hence  they  are  facets 
of  conv(X ). 

Finally,  while  the  blossom  inequalities  (3)  are  strong  for  the  case  that  dj  <  2  for  all  j  £  V 2,  they  are  not 
facet-defining  for  the  general  case  of  TPMC  based  on  our  experience  with  PORTA  [7]. 


5  Computational  Results 

In  this  section  we  present  our  computational  results  for  the  TPMC  problem.  We  conduct  the  experiments 
on  an  Intel  Xeon  x5650  Processor  at  2.67GHz  with  4GB  RAM.  We  use  IBM  ILOG  CPLEX  12.4  as  the  MIP 
solver.  We  test  the  TPMC  problem  for  various  settings  of  V\  and  V2.  There  are  12  combinations  of  V\  and 
V2  as  shown  in  Tables  1  and  2,  in  the  first  column.  For  each  combination,  we  create  3  instances  and  report 
the  averages.  We  observed  that  most  instances  of  the  TPMC  problem  are  solved  under  a  minute  for  each 
setting  of  V\  and  V2.  Therefore,  we  found  “hard”  instances  by  continually  generating  and  solving  instances 
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until  we  were  able  to  find  3  that  were  solved  in  at  least  15  minutes  under  default  CPLEX  settings.  Problem 
parameters  are  generated  using  a  discrete  uniform  distribution  with  supply  values  s,;  G  [10,  20],  demand  values 
dj  G  [10,20],  weights  uy?-  G  [20,50]  and  lost  revenues  r:j  G  [5000,6000].  In  our  computations,  we  impose  a 
time  limit  of  half  an  hour,  and  consider  the  following  four  algorithms: 

(1)  BB  (Branch  and  Bound):  TPMC  formulation,  (la)-(le)  with  no  cuts, 

(2)  UC  (User  Cuts):  TPMC  formulation,  (la)-(le)  with  only  user  cuts, 

(3)  CD  (CPLEX  Default  Settings):  TPMC  formulation,  (la)-(le)  with  default  CPLEX  cuts, 

(4)  UCD  (User  Cuts  and  CPLEX  Default  Settings):  TPMC  formulation,  (la)-(le)  with  user  cuts  and  default 

CPLEX  cuts. 


Table  1:  Comparison  of  Algorithms  BB  and  UC 


IGI,IL2l 

RGap 

RCuts 

EGap 

ECuts 

Time  (unslvd) 

B&C  Nodes 

BB 

UC 

BB  UC 

BB 

UC 

BB  UC 

BB 

UC 

BB 

UC 

200,230 

86.7% 

86.7% 

u8 

0.8% 

1.0% 

ul89 

1800(3) 

1800(3) 

176305.3 

107412 

200,240 

1.7% 

1.7% 

u4.3 

0.7% 

0.8% 

ul86 

1800(3) 

1800(3) 

168384.7 

91004.7 

200,250 

28.1% 

28.1% 

u4.7 

0.2% 

0.2% 

ul03.3 

1800(3) 

1800(3) 

141624.3 

89966.7 

300,330 

54.3% 

54.3% 

u4 

0.5% 

0.5% 

ul25.7 

1800(3) 

1800(3) 

75466.3 

56085.3 

300,340 

1.6% 

1.6% 

u6 

0.9% 

0.9% 

ul23.3 

1800(3) 

1800(3) 

60125.7 

50122.7 

300,350 

58% 

29.5% 

u4.3 

0.4% 

0.4% 

u98.3 

1800(3) 

1800(3) 

51141.3 

44166.7 

400,430 

0.8% 

0.7% 

u4 

0.4% 

0.5% 

u74.7 

1800(3) 

1800(3) 

29565 

32466.7 

400,440 

81% 

54.3% 

u4.7 

0.3% 

0.3% 

u72.7 

1800(3) 

1800(3) 

23177.7 

26600 

400,450 

0.4% 

0.4% 

u4.7 

0.2% 

0.2% 

u57.7 

1800(3) 

1800(3) 

22593 

23733.3 

500,530 

83.2% 

83.2% 

u7 

0.5% 

0.2% 

u52 

1800(3) 

1690.5(2) 

18152.3 

16380 

500,540 

81.4% 

81.4% 

u4 

0.4% 

0.4% 

u61.3 

1800(3) 

1800(3) 

16115 

17573.3 

500,550 

0.6% 

0.6% 

u6.3 

0.4% 

0.4% 

u21.7 

1800(3) 

1800(3) 

14767.7 

18100 

Average 

43.5% 

35.2% 

u5.2 

0.5% 

0.5% 

u96.4 

1800(3) 

1790.9(2.9) 

66451.5 

47801 

Table  2:  Comparison  of  Algorithms  CD  and  UCD 


|VT|,|Vb| 

RGap 

RCuts 

EGap 

ECuts 

Time  (unslvd) 

B&C  Nodes 

CD 

UCD 

CD 

UCD 

CD 

UCD 

CD 

UCD 

CD 

UCD 

CD 

UCD 

200,230 

58.4% 

58.1% 

10.7 

6,u2.3 

0.2% 

0.2% 

568.7 

569,u37.3 

1342.3(1) 

1249.8(1) 

64767 

57478.7 

200,240 

1.6% 

1.6% 

10 

9.3,u2.7 

0.4% 

0.6% 

307.3 

219.7,ul01.7 

1420.9(2) 

1333.9(2) 

93360.7 

74344.3 

200,250 

28.1% 

1% 

7.3 

4,u3.3 

0.1% 

0.1% 

573.3 

412,ul7.3 

1265.4(1) 

815.6(1) 

53962.7 

39750 

300,330 

0.8% 

0.7% 

12 

8,u2 

0.3% 

0.3% 

164.7 

178.3, u50.3 

1800(3) 

1227.1(2) 

72057.7 

33551.3 

300,340 

1.6% 

1.5% 

13.3 

7.3,  ul.  3 

0.2% 

0% 

334.7 

239,ul6.3 

1678.3(1) 

1067.6(1) 

49950 

31914.3 

300,350 

29.5% 

29.5% 

4.7 

5.7, u2 

0.2% 

0.2% 

161 

139.3, u53.3 

1025.9(1) 

901.7(1) 

29653.7 

23473.7 

400,430 

0.7% 

0.7% 

10.7 

8.3,u2.7 

0.2% 

0.3% 

114.7 

105.7,u41 

1800(3) 

1234.9(2) 

34852 

21395.7 

400,440 

27.5% 

27.8% 

12 

8,u3.3 

0.2% 

0.1% 

128.7 

167.7, u25 

1800(3) 

1216.6(2) 

24729.3 

16442.7 

400,450 

0.4% 

0.4% 

6 

6.3,u3.7 

0.1% 

0.1% 

133 

173.3, u23 

1800(3) 

1800(3) 

19398.7 

22166.7 

500,530 

42.3% 

42.3% 

7.7 

5.3, u3 

0.2% 

0.2% 

76.3 

86.3, u55 

1800(3) 

1661.7(2) 

34852 

21089.7 

500,540 

27.9% 

27.9% 

7 

8.7, ul 

0.4% 

0.2% 

58.3 

151.3,u32.7 

1800(3) 

1800(3) 

20709 

18757.3 

500,550 

0.6% 

0.6% 

7 

8.7,u3.7 

0.4% 

0.3% 

26.3 

68,u34.7 

1800(3) 

1800(3) 

17482 

18707.3 

Average 

18.3% 

16% 

9 

7.1, u2. 6 

0.2% 

0.2% 

220.6 

209.1,u40.6 

1611.1(2.3) 

1342.4(1.9) 

42981.2 

31589.3 

In  Tables  1  and  2,  column  RGap  reports  the  average  percentage  integrality  gap  at  the  root  node  just 
before  branching,  which  is  100  x  ( zub —  zrb)/ zub ,  where  zub  is  the  objective  function  value  of  the  best  integer 
solution  obtained  within  time  limit  and  zrb  is  the  best  lower  bound  obtained  at  the  root  node.  Column 
RCuts  reports  the  average  number  of  cuts  added  at  the  root  node.  In  column  EGap,  we  report  the  average 
percentage  end  gap  at  termination  output  by  CPLEX,  which  is  100  x  ( zub  —  zbest)  / zub,  where  zbest  is  the 
best  lower  bound  available  within  time  limit.  Column  ECuts  reports  the  average  number  of  cuts  added 
after  the  problem  is  solved  to  optimality  within  the  time  limit.  Column  Time  (unslvd)  reports  the  average 
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solution  time  in  seconds  and  the  number  of  unsolved  instances  in  parentheses  in  cases  where  not  all  three 
instances  are  solved  to  optimality  within  time  limit.  We  denote  the  user  cuts  by  u  and  for  the  other  cuts, 
i.e. ,  cuts  added  by  CPLEX  we  do  not  use  a  prefix.  In  column  B&zC  Nodes  we  report  the  average  number 
of  branch-and-cut  tree  nodes  explored.  At  the  end  of  Tables  1  and  2  we  give  the  averages  of  RGap,  RCuts, 
EGap,  ECuts,  Time  (unslvd)  and  B&C  Nodes,  respectively.  For  each  value  in  the  tables  we  report  the 
numbers  rounded  to  the  first  decimal  place. 

User  cuts  are  generated  every  10000  B&C  nodes.  For  the  variable  upper  bound  inequalities  (14)  we  add 
a  violated  inequality  if  Sj  <  dj,  i  £  Vi,  j  £  V2,  ( i,j )  £  E  and  Xij  >  S;(l  —  Zj).  Recall  that  inequalities  (19) 
are  related  to  the  weight  inequalities  for  0/1  knapsack  problems.  The  exact  separation  of  weight  inequalities 
involves  solving  0/1  knapsack  problems.  Weismantel,  Kaparis  and  Letchford  give  exact  pseudo-polynomial 
separation  algorithms  for  weight  inequalities  [15,  26].  The  optimization  problems  for  finding  the  most  violated 
inequalities  (15)  and  (27)  involve  nonlinear  objectives  and  constraints  that  resemble  knapsack  constraints. 
Thus,  we  use  a  heuristic  separation  for  inequalities  (15),  (19)  and  (27).  Let  ( x,z )  be  a  fractional  point.  The 
heuristic  for  finding  a  violated  inequality  (15)  takes  (. x,z )  and  selects  sets  /  and  J  simultaneously.  Set  J 
includes  a  market  with  fractional  z  value,  and  other  markets  that  receive  demand  from  the  same  suppliers  as 
the  market  with  fractional  z.  All  the  suppliers  that  do  not  send  demand  to  markets  in  set  J  are  placed  in  set 
I.  More  details  for  this  heuristic  can  be  found  in  Algorithm  2.  The  heuristic  for  finding  a  violated  inequality 
(19)  uses  the  type-I  base  inequalities  (15),  and  adds  the  smallest  p  coefficients  of  the  2  variables  that  exceed 
the  right-hand  side,  /3q  to  obtain  the  cover  J.  For  all  the  instances  in  Tables  1  and  2  the  violated  inequality 
(15)  (i.e.  type-I  base  inequality)  found  by  the  heuristic  separation  has  the  coefficients  of  all  the  z  variables 
equal  to  the  right-hand  side,  /3o-  It  is  easy  to  see  that  if  at  least  two  coefficients  of  2  variables  are  not  strictly 
less  than  the  right-hand  side,  flo  hr  a  given  type-I  base  inequality,  the  new  inequality  of  type  (19)  cannot  be 
a  facet  of  conv(X).  Therefore,  for  the  given  instances  no  violated  inequality  of  type  (19)  is  generated.  Note 
that  our  separation  heuristic  for  inequality  (19)  is  different  than  that  of  [14,  15,  26]  because  our  choice  of  set 
J  also  impacts  the  continuous  term  t  =  Thiel  j)eExiji  which  is  not  present  in  their  setting.  We  have 
three  heuristics  for  finding  a  violated  inequality  (27).  Two  of  them  uses  the  supply  constraints  as  a  base 
inequality  for  a  certain  choice  of  J  (i.e.  TjeJ-(i  j)eE  xij  —  s*  f°r  *  €  Vl  and  J  C  V2),  one  of  which  finds  an 
inequality  with  |  J\  =  1  and  the  other  finds  an  inequality  with  |  J\  =  |J|  —  1.  The  details  for  these  heuristics 
are  given  by  Algorithms  3  and  4,  respectively.  The  third  heuristic  uses  Thiev-i  jev2-(i  j)eE  xij  —  s(hi)  as  a 
base  inequality  and  finds  a  violated  inequality  with  J  that  includes  the  rejected  markets  and  markets  that 
have  fractional  z  values.  More  details  on  this  heuristic  is  given  in  Algorithm  5. 

Table  1  compares  the  performance  of  algorithms  BB  and  UC,  to  isolate  the  reduction  in  the  root  gap 
(8.3%)  using  our  inequalities.  Similarly,  Table  2  compares  the  performance  of  the  algorithms  CD  and  UCD, 
to  illustrate  the  marginal  benefit  of  incorporating  our  inequalities  into  default  CPLEX,  where  we  observe  a 
reduction  in  the  root  gap  of  2.3%.  Due  to  the  reduction  in  the  integrality  gap  the  number  of  branch-and-cut 
nodes  is  almost  always  lower  for  UC  and  UCD  compared  to  BB  and  CD,  respectively.  The  solution  times 
and  the  number  of  unsolved  instances  are  slightly  lower  for  algorithms  that  include  our  proposed  inequalities. 
However,  the  end  gap  is  not  lower  for  algorithms  UC  and  UCD  compared  to  BB  and  CD,  respectively.  In 
conclusion,  our  preliminary  computational  results  show  that  our  proposed  inequalities  does  have  some  positive 
effects,  but  the  separation  heuristics  need  to  be  significantly  improved. 


34 


DISTRIBUTION  A:  Distribution  approved  for  public  release. 


Algorithm  2  Heuristic  separation  for  inequalities  (15) 

Input:  (x,  z) 

Output:  Sets  I  and  J  and  the  corresponding  cut  for  each  fractional  z 

I  <-V i 
s(V1\I)  =  0 
d(J)  =  0 

tempSupplies  4—  0 
tempDemand  4—  0 
switch  =  0 

for  all  the  fractional  variables  Zj  do 
tempDemand  =  {j} 

J={j} 

while  \tempDemand\  >  1  or  tempSupplies |  >  1  do 
if  switch  =  0  then 

for  all  the  supplies  i  that  have  an  edge  to  all  nodes  j  in  tempDemand  do 

if  Xij  >  0  then 

s(Vi  \  I)  <—  s(Vi  \  I)  +  Si 
tempSupplies  4—  tempSupplies  U  {i} 

end  if 
end  for 

switch  =  1 
tempDemand  0 

end  if 

if  switch  =  1  then 

for  all  demand  j  that  have  an  edge  to  all  nodes  i  in  tempSupplies  do 
if  Xij  >  0  then 
J  <-  J  U  {j} 
d(J)  4—  d(J)  +  dj 

tempDemand  tempDemand  U  {j} 

end  if 
end  for 

switch  =  0 
tempSupplies  4—  0 

end  if 
end  while 

if  d(J)  >  s(Vi  \  I)  and  |  J|  >  2  and  max.jej{dj}  >  d(J)  —  s(V \  \  I)  then 

if  E  iei,j£j:(i,j)£E  *ij  +  Eje  j(min{d(, J)  -  s(Vi  \  I),  dj})zj  <  d{J)  -  s(V i  \  I)  then 
add  inequality  (15)  with  /  and  J 

end  if 
end  if 

I  4-  V 1 

s(E  \  /)  =  o 

d(J)  =  0 
switch  =  0 

end  for 
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Algorithm  3  Heuristic  separation  for  inequalities  (27)  that  finds  |  J|  =  1 

Input:  ( x ,  z) 

Output:  Sets  I,  J,  J  and  the  corresponding  cut  for  each  fractional  z 

/,  J,  J  t-  0 
d(J\J)  =  0 
a  =  0 

for  all  the  fractional  variables  Zj  do 

J  {j},  J  {j} 

for  all  i  such  that  Xjj  >0  do 

for  all  j*  j  do 

if  ay.,*  =  d-j *  then 

d(J  \  J)  =  d(  J  \  J)  +  dj * 

end  if 
end  for 

a  =  Si  —  d(J  \  J) 

if  |  J|  >  2  and  J2jeJ.(i,j)eE  %ij  +  a*j  >  then 
add  inequality  (27)  with  /,  J,  J  and  a 

end  if 

/  <—  0,  J  {_)},  d(  J  \J)=  0 

end  for 
end  for 
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Algorithm  4  Heuristic  separation  for  inequalities  (27)  that  finds  |  J\  =  |  J\  —  1 

Input:  (x,  z) 

Output:  Sets  I,  J,  J  and  the  corresponding  cut  for  each  fractional  z 
Jo  <—  {j  £  Vi  :  Zj  =  0} 

J\  £-  {j  £  Vi  :  Zj  =  1} 

/  t—  0 
a  =  0 

maxdjJ  =  maxjgjjjdj} 

for  all  the  fractional  variables  Zj  do 

J  •*—  Ji  U  {j } 
if  maxdjJ  <  dj  then 
maxdjJ  =  dj 

end  if 

for  all  i  £  Vi  do 
for  all  j'  £  Jo  do 

if  Xjji  >  0  and  .s,  >  dj>  and  st  —  dj <  <  maxdjJ  then 
a  =  Sj  —  dj' 

I  £-  {•«},  J  ■£-  J  U  {/} 

if  j:[i.j;-  >  si  +  (1^1  -  l)a  then 

add  inecjuality  (27)  with  /,  J,  J  and  a 

end  if 
end  if 
end  for 
end  for 
end  for 


Algorithm  5  Heuristic  separation  for  inequalities  (27)  that  finds  general  J 

Input:  (x,  z) 

Output:  Sets  /,  J,  J  and  the  corresponding  cut 

Jf  t—  {j  £  V2  :  0  <  Zj  <  1} 

Ji  <s—  {j  £  V2  :  Zj  =  1} 

I  <-V L 

J^V2 
J  i —  J  f  U  J\ 
a  =  ° 

maxdjJ  =  ma,Xj£jlUjf{dj} 

if  s(Hi)  —  d(V 2  \  J)  >  0  and  s(Vi)  —  d(V 2  \  J)  <  maxdjJ  then 
a  =  s(V1)-d(V2\J) 

if  EiGVl,jev2:(i,j)GEXij+aJ2jejZj  >  s{V 1)  +  (|J|  -  l)a  then 
add  inequality  (27)  with  /,  J,  J  and  a 

end  if 
end  if 
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A  Proofs  of  Section  2 


In  this  section,  we  assume  that  all  data  are  integral. 

Proposition  1.  The  decision  version  of  TPMC  is  NP-complete  even  when: 

1.  Si  =  1  for  all  i  GV dj  =  d  >  3  for  all  j  £  V 2,  Wij  =  0  for  all  ( i,j )  €  E  and  rj  =  1  for  all  j  £  V2. 

2.  | Pi  |  =  1  and  w.ij  =  0  for  all  ( i,j )  G  E. 

Proof.  Since  TPMC  is  a  mixed  integer  linear  problem  with  rational  data,  it  is  in  NP.  We  present  two  reductions 
to  verify  the  two  parts  of  this  result. 

1.  We  reduce  every  instance  of  the  Exact  3-Cover  (E3C)  problem  to  an  instance  of  TPMC.  An  instance 
of  E3C  is  given  as:  Let  B  be  a  base  set  where  \B\  =3 q  for  some  q  G  N.  Let  C  be  a  collection  of  subsets 
of  B  where  each  subset  is  of  cardinality  3.  Does  there  exist  D  C  C  such  that  \D\  =  q  and  the  union  of 
sets  in  D  covers  every  element  of  B? 

It  is  well-known  that  E3C  is  strongly  NP-complete  [10].  Given  an  instance  of  E3C,  we  construct  an 
instance  of  TPMC  as  follows:  For  every  element  in  B ,  we  construct  a  node  in  Pi  and  for  every  element 
in  C  we  construct  a  node  in  V2.  For  i  G  Pl,  we  use  the  notation  B(i)  to  denote  the  element  of  B 
corresponding  to  node  i.  Similarly,  for  j  G  P2,  we  let  C(j)  denote  the  element  of  C  corresponding  to 
node  j.  We  add  an  edge  between  i  GV 1  and  j  G  V2  if  B(i)  G  C{j).  Let  Sj  =  1  for  all  i  G  Pi.  Let  dj  =  3 
for  all  j  €  P2.  Let  uqj  =  0  for  all  (z,  j)  G  E.  Let  =  1  for  all  j  G  V2. 

Next,  we  verify  that  there  exists  D  C  C  such  that  \D\  =  q  and  D  covers  every  element  of  B  if  and  only 
if  there  exists  a  feasible  solution  to  TPMC  with  a  cost  at  most  \C\  —  q.  Note  that  the  size  of  the  TPMC 
instance  is  polynomially  bounded  by  the  size  of  the  E3C  instance. 


(=>)  Assume  that  there  exists  {D\, . . . ,  Dq}  =:  D  C  C  such  that  D  covers  every  element  of  B.  Let  D(u) 
represent  the  element  of  D  (and  therefore  of  C)  that  contains  u  G  B.  Now  construct  the  solution 


1  if  B{i)  =  u  and  C(j)  =  D(u ) 
0  otherwise. 

1  ifC(j)?{Dly...,Dq} 

0  otherwise. 


It  is  straightforward  to  verify  that  ( x,z )  satisfies  all  the  constraints  of  TPMC  and  Yh(i,j)eE  wij&ij  + 

Eiev2  W  =  | Cl  -  q. 

(^=)  Consider  a  solution  (x,  z)  of  TPMC  such  that 


5Z  wijXij  +  Zj  <\C\~  q-  (58) 

(i,j)€E  jev2  j&V2 

Since  there  are  3 q  supply  nodes,  each  with  a  capacity  of  1,  the  demand  of  at  most  q  nodes  can  be 
satisfied.  Therefore,  from  (58),  we  conclude  that  there  are  exactly  q  nodes  whose  demands  are  satisfied. 
Let  D  =  {C(j)  |  J2ieVl  Zj  =  0}-  Clearly,  \D\  =  q  and  D  covers  every  element  of  B.  As  a  result  TPMC 
is  strongly  NP-complete. 
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2.  We  reduce  every  instance  of  the  Subset  Sum  (SS)  problem  to  an  instance  of  TPMC.  An  instance  of  SS 
is  given  as:  Let  A  be  a  finite  set,  an  £  Z+  be  the  size  of  each  element  n  £  A  and  B  be  a  positive  integer. 
Does  there  exist  a  subset  A  C  A  such  that  the  sum  of  the  sizes  of  the  elements  in  A  is  exactly  £?? 

It  is  well-known  that  SS  is  NP-complete  [10].  Given  an  instance  of  SS,  we  construct  an  instance  of 
TPMC  as  follows:  We  construct  a  single  node  V\  =  {1}  and  for  every  element  in  A  we  construct  a  node 
in  V2.  We  add  all  the  edges  between  the  nodes  in  V\  and  V2.  Let  the  single  supply  be  Si  =  B.  Let 
demand  of  market  j  be  dj  =  ctj  for  all  j  £  V2  =  A.  Finally,  let  the  unit  shipping  costs  and  lost  revenues 
be  Wij  =  0  and  rj  =  dj ,  for  j  £  V2 . 

Next,  we  verify  that  there  exists  subset  A  C  A  such  that  the  sum  of  the  sizes  of  the  elements  in  A  is 
exactly  B  if  and  only  if  there  exists  a  feasible  solution  to  TPMC  with  a  cost  of  at  most  a«  —  B. 

Note  that  the  size  of  the  TPMC  instance  is  polynomially  bounded  by  the  size  of  the  SS  instance. 

C  A  such  that  the  sum  of  the  sizes  of  the  elements  in 

a,j  if  j  £  A 
0  otherwise, 

1  if  j  i  A' 

0  otherwise. 

It  is  straightforward  to  verify  that  (x,z)  satisfies  all  the  constraints  of  TPMC  and  X(;  j)eEwij^ij  + 
52jev2  r3*j  =  XXeA  an  ~  B- 

(4=)  Consider  a  solution  (x,z)  of  TPMC  such  that 

H  H  rjZj  =  H  a3S3  —  Un  ~  B-  (59) 

(biles  j&v2  jev 2  neA 

The  total  demand  satisfied  by  any  feasible  solution  is  at  most  B  since  we  cannot  satisfy  more  than  the 
supply.  Furthermore,  since  each  edge  has  a  cost  per  unit  flow  of  0,  we  have  that  j)eE  wij%ij  =  0. 
Therefore,  from  (59),  the  total  demand  satisfied  must  equal  B.  Let  the  set  of  satisfied  demand  nodes 
be  A  =  {j  £  A  :  2j  =  0},  so  we  have  XXeA'  a«  =  B. 


(=>)  Assume  that  there  exists  a  subset  A 
A  is  exactly  B.  Now  construct  the  solution 


□ 

Proposition  2.  Suppose  that  dj  <  2  for  all  j  £  V2.  Then  there  exists  a  polynomial-time  algorithm  to  solve 
TPMC. 

Proof.  We  can  convert  a  given  instance  of  TPMC  with  dj  <  2  for  all  j  £  V2  and  arbitrary  supplies  into  an 
equivalent  instance  with  all  supplies  equal  to  1.  Observe  that  in  any  feasible  solution  since  dj  <  2  for  all 
j  £  V2,  no  supply  can  send  more  than  2 1 V2 1  units.  Therefore,  if  s,  >  1  for  some  i  £  V),  then  we  construct  an 
updated  instance  by  replacing  supply  node  i  £  V\  with  min{si,  2 1 V2 1 }  supply  nodes  with  a  capacity  of  1  and 
unit  shipping  cost  to  demand  node  j  of  vjtj  for  (i,j)  £  E.  Notice  that  the  resulting  instance  is  polynomial  in 
the  size  of  the  original  problem.  Therefore  from  now  on  we  assume  that  s*  =  1  for  all  i  £  V\. 

We  show  that  TPMC  with  dj  <  2  for  all  j  £  V2  is  equivalent  to  the  problem  of  finding  a  minimum  weight 
perfect  matching  on  a  suitably  constructed  general  graph  G'  =  {V ,  E'). 

1 .  For  each  i  £  V) ,  we  add  a  corresponding  i  £  V 1  and  similarly  for  each  j  £  V2  we  add  j  £  V' .  (When  we 
use  notation  V\  C  V' ,  Vi  represents  the  vertices  of  V'  corresponding  to  the  vertices  V\  of  G ;  similary 
for  V2.) 

2.  Let  M\  =  {j  £  V2  :  dj  =  1}  and  M2  =  {j  £  V2  :  dj  =  2}. 
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3.  For  each  demand  node  j  G  V2,  add  a  node  j'  G  V'  (note  that  this  is  in  addition  to  j  G  V'  for  j  G  V2 

as  described  in  1.).  Add  an  edge  (j,f)  G  E'  with  a  cost  of  rj.  We  refer  to  the  set  of  nodes  j'  G  V' 

corresponding  to  j  G  Mi  as  M[.  (We  define  M 2  similarly.) 

4.  For  each  i  G  V\  such  that  (i,j)  G  E  and  j  G  Mi,  add  the  edge  (i,  j)  G  E1  with  cost  of  'Wl3 . 

5.  For  each  i  G  Vj  such  that  (i,j)  G  E  and  j  G  M2,  add  two  nodes,  ijl,  ij2  G  V .  Add  edges 

(Mi  1),  (u'l,*j2),  (ij2,j),  (ij2,j')  G  E'  with  costs  0,  "  N ,  ^  respectively. 

6.  If  V]  is  odd,  we  add  an  additional  artificial  node  {0}  to  V’ .  Let  V{  C  V’  be  defined  as  V{  :=  V\  U  M[ 

if  |Vi|  is  even  and  V{  :=  Vi  U  M[  U  {0}  if  \V\\  is  odd. 

7.  For  all  u,v  G  V[  such  that  add  an  edge  (u,v)  G  E'  with  a  cost  of  0.  Therefore,  the  subgraph 

induced  by  the  nodes  in  V[  is  a  complete  graph/clique. 

Note  that  the  size  of  the  resulting  minimum  weight  perfect  matching  problem  is  polynomial  in  the  size  of  the 
TPMC  problem.  Figure  2  illustrates  the  original  graph  of  a  TPMC  instance,  where  the  demand  of  market  A 
is  2  and  that  of  market  B  is  1.  Figure  3  illustrates  the  new  graph.  (The  clique  induced  by  V\  U  {B'}  U  {0}  is 
not  shown.) 


V, 


v2 


rA 


WIA 


Figure  2:  A  TPMC  instance  Figure  3:  Construction  of  G' 

We  next  show  that  any  solution  to  the  TPMC  problem  corresponds  to  a  perfect  matching  in  G'  =  (V7,  E'). 
Consider  a  feasible  solution  (x,  z )  to  the  TPMC  problem.  If  Zj  =  0  for  j  G  Mi,  then  there  exists  exactly  one 
supply  node  i  such  that  Xjj  =  1.  For  constructing  a  matching  in  G' ,  we  choose  edge  where  i  G  V 1  and 

j  G  Mi,  thereby  covering  nodes  i  and  j  in  V' .  If  Zj  =  0  for  j  G  M2,  then  there  exists  two  supply  nodes  ii  and 
*2  G  Vi  such  that  x^j  =  Xi2j  =  1.  For  constructing  a  matching  in  G' ,  without  loss  of  generality,  we  choose 
edges  (ii,iijl),  (iij2,j),  1)  and  (i2j2,j'),  thereby  covering  nodes  ii,i2,iijl,iij2,i2jl,i2j2,j,f.  If 

Zj  =  1  for  j  G  V21  then  no  supply  node  i  sends  demand  to  j  and  for  the  matching  we  choose  edge 
hence  covering  nodes  j  and  j'  in  V' .  Moreover  if  j  G  M2,  we  choose  edges  (ijl,ij2)  for  all  (i,j)  G  E,i  G  Vi 
in  the  matching  and  therefore  the  nodes  ijl,  ij 2,  j,  j'  are  also  covered.  Hence  whether  Zj  =  1  or  z3  =  0,  and 
whether  j  G  Mi  or  j  G  M2,  the  nodes  in  V2,  M2,  and  the  nodes  ijl,ij2  for  all  (i,  j)  G  E,j  G  M2  are  always 
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covered  by  the  edges  in  the  matching  we  have  selected  thus  far.  To  complete  the  proof  we  show  how  nodes 
i  £  V(  are  also  covered  in  all  cases  by  extending  the  matching  we  have  until  now. 

Let  M\  =  {j  £  Mi  :  Zj  =  0},  M2  =  {j  £  M2  :  Zj  =  0}  and  Vi  =  {i  £  Vi  :  Xij  =  1}.  In  other  words,  set  Mi 
represents  the  nodes  j  £  Mi  whose  unit  demands  are  satisfied,  set  M2  represents  the  nodes  j  £  M2  whose 
demands,  dj  =  2,  are  satisfied,  and  set  Vi  represents  the  set  of  supply  nodes  that  send  demand.  Observe  that 
the  nodes  in  Vi  are  also  covered  in  the  matching  constructed  thus  far.  However,  the  nodes  j  €  Vi  \  Vi,  and 
j'  €  M[  for  j  £  Mi  and  {0}  (if  it  exists)  are  not  yet  covered.  Note  that  |Vi|  =  \Mi\  +  2|M2|.  We  consider 
two  cases. 

1.  |Vi|  is  even.  If  |Vi|  is  even,  then  \Vi\  —  \Vi\  and  \Mi\  are  even.  If  |Vi|  is  odd,  then  \Vi\  —  |Vj|  and  \Mi\ 

are  odd.  Therefore,  |Vi|  —  \Vi\  +  |Mi|  is  always  even.  Thus,  we  can  cover  all  nodes  i  £  V)  \  Vi  and 

j'  £  M[  for  j  £  Mi  using  \Vl\~\vd+\Mi\  marLy  disjoint  edges  that  exist  between  them  (recall  that  the 
subgraph  induced  by  the  nodes  i  €  V{  form  a  complete  graph). 

2.  |Vi|  is  odd.  If  | Vi |  is  even,  then  | Vf |  —  |Vi|  is  odd  and  |Mi|  is  even.  If  | Vi |  is  odd,  then  \Vi\  —  |Vi|  is 

even  and  \Mi\  is  odd.  Therefore,  | Vi |  —  |Vj|  +  \Mi\  is  always  odd.  Recall  that  when  \Vi\  is  odd  we 

have  an  additional  dummy  node  {0}  that  forms  a  fully  connected  graph  with  nodes  i  £  Vi  and  j  £  M[. 
Therefore,  we  obtain  an  even  number  of  nodes  that  need  to  be  covered  by  choosing  lyi|-|Vitf |AJi|+i 
disjoint  edges. 

So  we  have  verified  that  given  any  solution  to  the  TPMC  problem  we  can  find  a  perfect  matching  in  G'  = 
(V',E').  Moreover,  it  is  straightforward  to  check  that  the  cost  of  this  matching  is  equal  to  the  cost  of  the 
given  solution  to  TPMC. 

Next  we  show  that  any  solution  to  the  perfect  matching  in  G 1  =  ( V' ,  E’)  corresponds  to  a  feasible  solution 
of  the  TPMC  problem.  Let  P  be  the  set  of  edges  that  are  in  the  perfect  matching.  If  edge  ( j',j )  £  P  for 
j'  £  M[,  j  £  Mi  (or  j  £  M2,  j'  £  M'2),  then  set  Zj  =  1.  Set  all  remaining  zj  =  0.  If  edge  ( i,j )  £  P  for 
j  €  Mi,  then  we  set  xi3  =  1.  If  edge  ( i ,  ij  1)  £  P,  then  set  X{3  =  1.  Set  all  remaining  xi3  =  0.  Note  that  due  to 
the  construction  of  graph  G',  a  supply  node  i  £  V i  can  send  at  most  1  unit  of  demand.  Similarly  for  j  £  Mi 
a  single  edge  that  has  j  as  one  of  its  endpoints  will  be  selected.  For  j’  £  M2,  j  £  M2  if  edge  (j,  jr)  £  P,  then 
for  any  i  £\ \  edges  (' ij2,j ),  (ij2,j')  £  P.  However,  if  edge  P  then  for  a  perfect  matching  there  must 

exist  exactly  two  ii,i2  £  Vi  such  that  (iij2,j),  (i2j2,  f  )  £  P.  Therefore,  for  any  j  £  M2  either  the  demand  is 
fully  satisfied  or  it  is  rejected  altogether.  Finally,  it  is  easy  to  see  that  the  cost  of  the  solution  to  the  TPMC 
problem  is  equivalent  to  the  cost  of  the  corresponding  perfect  matching  in  G',  completing  the  proof. 

□ 
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Abstract 

We  propose  a  scenario  decomposition  algorithm  for  stochastic  0-1  programs.  The  algorithm  recovers 
an  optimal  solution  by  iteratively  exploring  and  cutting-off  candidate  solutions  obtained  from  solving  sce¬ 
nario  subproblems.  The  scheme  is  applicable  to  quite  general  problem  structures  and  can  be  implemented 
in  a  distributed  framework.  Illustrative  computational  results  on  standard  two-stage  stochastic  integer 
programming  and  nonlinear  stochastic  integer  programming  test  problems  are  presented. 


1  Introduction 


We  consider  stochastic  programs  of  the  following  form 

min{E[/(x, £)]  :  x  £  X  C  {0, 1}”},  (1) 

where  £  is  a  random  vector  with  support  E  and  known  distribution  P,  and  the  expectation  in  (1)  is  with 
respect  to  P.  An  important  example  of  (1)  is  the  class  of  two-stage  stochastic  programs  with  0-1  first  stage 
variables  with 

f(x,0  =  cT x  +  min{(/>(y(£),  £)  :  y(£)  €  F(£,a;)} 

where,  for  realization  £  of  £,  y(£)  is  the  second  stage  decision  vector,  </>(•,£)  is  the  second  stage  objective 
function,  and  Y (£,  x)  is  the  second  stage  constraint  system  depending  on  the  first  stage  decision  vector 
x.  We  assume  that  the  random  vector  £  has  a  finite  support,  i.e.  5  =  {£*, . . . .  £ N } ,  where  each  £*  for 
i  £  {!.....  A? }  i s  referred  to  as  a  scenario.  We  can  then  rewrite  (1)  as 


min 


N 


£/<(*):  seA-CjO,!}" 


.  i= 1 


(2) 


where  fi(x)  =  Pif(x ,  £*)  and  p,  is  the  probability  mass  associated  with  scenario  i. 

A  popular  approach  for  solving  (2)  is  the  so-called  scenario  or  dual  decomposition  method.  By  making 
copies  of  the  decision  variables  x,  problem  (2)  can  be  reformulated  as 


{N  N 

fi(x')  :  x1  £  X  V  i,  ^  AiX1  =  h 
*= 1  »= 1 

where  the  equations  XJili  -^tx1  =  h  enforce  the  nonanticipativity  constraints  x 1  =  •••  =  xN .  The  La- 
grangian  dual  problem  by  dualizing  these  nonanticipativity  constraints  take  the  form 


(N. 

max  <  v(X)  :=  ^  min  {  fi(xz)  +  X1  AiX1 

A  l  l=i 


cl  £  X}  —  XTh 


(3) 
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where  A  is  the  dual  vector.  For  any  A  the  value  v(  A)  provides  a  lower  bound  to  (2)  which  can  be  evaluated  by 
separately  solving  subproblems  corresponding  to  each  scenario.  The  best  possible  such  bound  is  given  by 
the  optimal  value  of  the  dual  problem  (3)  which  is  a  large-scale  and  nonsmooth,  albeit  convex,  optimization 
problem  and  is  in  general  quite  challenging.  Even  with  an  optimal  dual  solution,  owing  to  the  presence  of 
integer  decision  variables,  there  typically  remains  a  duality  gap,  and  moreover  a  primal  feasible  solution 
(one  that  satisfies  the  original  and  the  nonanticipativity  constraints)  is  not  readily  available.  Caroe  and 
Schultz  [5]  proposed  a  branch  and  bound  algorithm  where  the  dual  problem  (3)  is  used  to  generate  lower 
bounds,  the  integer  feasible  solutions  to  the  scenario  subproblems  are  "averaged"  to  generate  a  possibly 
fractional  primal  solutions,  and  the  feasible  region  is  successively  partitioned  by  branching  on  the  fractional 
variables  to  enforce  integrality.  Alonso-Ayuso  et.al.  [2]  propose  solving  scenario  subproblems  (with  A  = 
0)  with  a  branch-and-fix  approach  that  coordinates  the  selection  of  the  branching  nodes  and  branching 
variables  in  order  to  enforce  the  nonanticipativity  constraints.  A  number  of  heuristics  [6,  9,  15]  have  also 
been  designed  based  on  the  scenario-wise  decomposability  of  the  dual  problem  (3).  In  these  approaches  the 
dual  problem  is  augmented  by  an  additional  penalty  to  achieve  consensus  among  the  scenario  subproblem 
solutions,  and  the  penalty  parameters  are  iteratively  updated  to  try  to  achieve  primal  feasible  solutions.  No 
convergence  guarantees  are  available  for  such  approaches,  but  they  are  reported  to  have  good  performance 
in  various  applications. 

In  this  paper  we  propose  a  scenario  decomposition  algorithm  for  (2)  that  proceeds  by  solving  scenario 
subproblems  to  generate  candidate  solutions  and  lower  bounds,  evaluating  candidate  solutions  to  get  up¬ 
per  bounds,  and  cutting  off  candidate  solutions  from  the  scenario  subproblems  to  get  improved  lower 
bounds  and  new  candidate  solutions.  The  scheme  is  applicable  for  quite  general  problem  structures  and, 
since  unlike  existing  exact  scenario  decomposition  methods  requires  little  coordination  among  scenario 
subproblems,  can  be  implemented  easily  in  a  distributed  framework.  The  remainder  of  the  paper  is  orga¬ 
nized  as  follows:  in  Section  2  we  describe  the  proposed  scenario  decomposition  method  along  with  some 
implementation  issues,  in  Section  3  we  discuss  some  justification  for  exploring  solutions  to  scenario  sub¬ 
problems  as  candidate  solutions  for  the  overall  problem,  and  finally  in  Section  4  we  present  some  computa¬ 
tional  results  using  a  distributed  implementation  of  the  proposed  method  on  standard  two-stage  stochastic 
integer  programming  and  nonlinear  stochastic  integer  programming  test  problems. 


2  A  Scenario  Decomposition  Algorithm 

We  propose  a  simple  scenario  decomposition  algorithm  that  exploits  the  scenario-wise  decomposability  of 
(3)  and  the  0-1  nature  of  the  variables.  The  algorithm  explores  solutions  to  the  scenario  subproblems  as 
candidate  primal  feasible  solutions  to  the  overall  problem.  The  explored  solutions  are  then  cut-off  from 
future  consideration  in  all  subproblems.  This  allows  for  an  improvement  in  the  lower  bound  by  restricting 
the  feasible  region  and  eventually  closing  the  duality  gap. 

The  overall  scheme  is  outlined  in  Algorithm  1.  In  the  lower  bounding  step  (lines  3-12),  the  scheme  uses 
the  decomposable  dual  problem  (3)  to  generate  lower  bounds,  and  a  set  of  candidate  solutions.  Note  that 
we  have  not  exactly  specified  how  to  update  the  dual  solutions  A  since  the  general  scheme  is  independent 
of  this.  In  fact,  the  scheme  is  valid  without  updating  A  at  all.  In  the  upper  bounding  step  (lines  13-22) 
each  of  the  candidate  solutions  is  evaluated  and  the  best  of  these  provide  an  upper  bound  and  becomes  the 
incumbent  solution  x*.  A  key  feature  of  the  scheme  is  that  the  evaluated  set  of  solutions  is  discarded  from 
the  set  of  feasible  solutions  (note  the  set  X  \  S  in  line  7).  Because  of  the  0-1  nature  of  the  solutions  this  can 
be  easily  accomplished  by  adding  "integer  cuts"  of  the  form 

(X  -  xi )  +  Xj>l  VxgS 

j-.Xj—l  j-Xj—0 

to  the  original  constraints  X  of  the  problem.  Recently  stronger  formulations  for  discarding  a  given  set  of 
0-1  solutions  have  been  proposed  which  can  also  be  used  in  this  context  [3]. 

Proposition  1  Assuming  that  solving  a  scenario  subproblem  (step  7)  and  evaluating  the  objective  function  (step  17) 
requires  finite  time,  Algorithm  1  (scenario  decomposition)  terminates  in  a  finite  number  of  iterations  returning  an 
optimal  solution  or  detecting  infeasibility. 
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Algorithm  1  Scenario  Decomposition 

l .  UB  < — boo,  LB  < - oo,  S  =  0,  x*  4—  0 

2:  while  UB  >  LB  do 

3:  Lower  bounding: 

4:  A  4-  0 

5:  while  some  termination  criteria  is  not  met  do 

6:  for  i  =  1  to  N  do 

7:  solve  min{  fi(x)  +  XT Ai.x  :  x  £  X  \  S} 

8:  let  Vi  be  the  optimal  value  and  xl  be  an  optimal  solution 

9:  end  for 

10:  update  A 

ll:  end  while 

12:  LB  4-  vi  -  xTh'  S  =  and  S  <-  S  U  S 

13:  Upper  bounding: 

14:  for  X  €  S  do 

15:  Mf-  0 

16:  for  i  =  1  to  N  do 

17:  compute  fi(x)  and  set  u  4—  u  +  fi  (x) 

18:  end  for 

19:  if  UB  >  u  then 

20:  UB  4—  u  and  x*  4—  x 

21:  end  if 

22:  end  for 

23:  end  while 


Proof:  If  the  problem  is  infeasible  the  lower  bound  is  set  to  +oo  and  the  algorithm  terminates  after  the  first 
iteration.  Otherwise,  finite  termination  of  the  algorithm  follows  from  the  fact  that  the  feasible  region  gets 
strictly  smaller  in  each  iteration  and  so  the  lower  bound  is  nondecreasing  (the  lower  bound  is  +oo  when 
the  feasible  region  becomes  empty).  Moreover  since  no  solution  is  ever  discarded  without  evaluation,  it 
follows  from  the  finiteness  of  the  solution  set  that  the  algorithm  returns  an  optimal  solution  in  a  finite  num¬ 
ber  of  iterations.  □ 

The  proposed  scenario  decomposition  algorithm  presumes  that  we  have  exact  oracles  to  solve  scenario 
subproblems  of  the  form: 

min{/j(a;)  +  A TAfx  :  x  £  X  \  S} 

for  any  A  and  S,  and  to  evaluate  the  objective  function  ffx)  for  a  given  solution.  These  are  similar  to  the 
assumptions  made  for  other  decomposition  schemes  based  on  the  dual  problem  (3).  For  two-stage  stochas¬ 
tic  integer  programs  these  steps  require  solving  many  single  scenario  integer  programs  and  can  present 
significant  bottleneck.  Of  course  instead  of  exact  computations,  we  can  compute  the  scenario  subproblems 
and  objective  function  to  some  pre-specified  tolerance,  use  the  best  lower  bound  in  the  lower  bounding 
step  and  the  best  upper  bound  in  the  upper  bounding  step,  and  terminate  the  algorithm  after  the  lower  and 
upper  bounds  are  within  tolerance.  Also,  note  that  in  the  upper  bounding  step  when  evaluating  a  solution 
x  we  do  not  need  to  consider  the  scenarios  for  which  this  solution  was  optimal  in  the  lower  bounding  step 
since  the  corresponding  objective  function  values  are  already  available.  Significant  computational  benefits 
can  be  obtained  by  using  relaxations  of  the  scenario  subproblems.  Let  /  Ax)  be  a  relaxation  of  fi(x),  i.e. 
/.( x)  <  fi(x)  for  all  x  £  X.  For  two-stage  stochastic  integer  programs  such  a  relaxation  could  be  obtained 
by  relaxing  the  integrality  requirements  on  the  second-stage  variables.  It  is  valid  to  use  /  .( x)  in  the  lower 
bounding  step.  The  upper  bounding  step  can  be  modified  as  follows:  for  a  given  candidate  solution  x  first 
evaluate  u  :=  ^2^=1  /  (x)  and  if  u  >  UB  then  we  do  not  need  to  further  consider  x  since  a  better  solution 
is  already  known;  on  the  other  hand  if  u  <  UB  then  we  need  to  evaluate  u  =  fi(x)  as  usual  before 
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x  can  be  cut-off  from  further  consideration.  We  could  also  evaluate  and  discard  only  a  small  subset  of  the 
solutions  found  rather  than  considering  all  the  scenario  subproblem  solutions  S.  This  would  reduce  the 
effort  within  a  main  iteration,  perhaps  at  the  expense  increasing  the  number  of  main  iterations. 

An  attractive  feature  of  the  proposed  scheme  is  that  there  is  very  little  interaction  between  the  scenario 
subproblems  in  the  lower  and  upper  bounding  steps.  Accordingly  the  scheme  can  be  easily  implemented 
in  a  distributed  framework.  Each  processor  can  be  assigned  a  subset  of  scenarios  and  would  essentially  run 
through  Algorithm  1  independently  except  for  broadcasting  and  collecting  solutions  and  objective  values 
after  solving  scenario  subproblems  in  step  7  and  after  evaluating  objective  functions  in  step  17.  Note  that  if 
a  subgradient  algorithm  (cf.  [4])  is  used  to  update  the  dual  solutions  A  where  the  update  only  depends  on 
the  scenario  subproblem  solutions  and  objective  values,  then  each  processor  can  locally  update  their  copies 
of  the  dual  solutions  after  exchanging  solutions  to  the  scenario  subproblems  in  step  7,  and  these  local 
dual  solutions  will  be  identical.  In  Section  4  we  provide  computational  results  using  such  a  distributed 
implementation. 


3  Optimality  of  scenario  solutions 

The  algorithm  proposed  in  the  previous  section  explores  solutions  to  the  scenario  subproblems  as  candi¬ 
dates  for  solutions  to  the  overall  problem.  This  is  a  departure  from  other  methods  based  on  solving  the  dual 
problem  (3)  where  candidate  solutions  are  generated  by  aggregating  solutions  to  the  scenario  subproblems. 
In  this  section  we  attempt  to  provide  some  rationale  why  the  solutions  to  the  scenario  subproblems  them¬ 
selves  can  be  good  solutions  to  the  overall  problem  in  the  case  the  problem  of  interest  is  a  sample  average 
approximation  problem. 

Consider  a  0-1  stochastic  program 

(P):  min{E[/(*,C)]:  x€lC{0,l}"}, 

and  its  sample  average  approximation 


(SAAn)  :  min 


'  '  '  ■ 


corresponding  to  an  iid  sample  {£*}^Li-  The  scenario  subproblem  corresponding  to  the  i-th  scenario /sample 
is 

(Pi)  ■  min{/(x,f)  :  x  e  X  C  {0,1}”}  • 

Let  Sn  be  the  set  of  optimal  solutions  of  (SAAn),  and  .S',  be  the  set  of  optimal  solutions  of  ( P, )  for  i  = 
1 ,N.  Note  that  these  are  random  sets  since  they  depend  on  the  sample  drawn.  We  investigate  the 
following  question: 

What  is  the  probability  that  the  set  of  solutions  to  one  of  the  scenario  problems  (Pi)  contain  a 
solution  to  (SAAn)? 

More  precisely  we  would  like  to  estimate  Pr[,S',v  fl  (U!=i  <?*)  /  0]  and  understand  its  dependence  on  X  and 
N. 

Trivially,  if  X  <  2  and  N  >  2  then  the  above  probability  is  1.  On  the  other  hand,  it  is  not  hard  to 
construct  examples  where  this  probability  is  arbitrarily  small.  For  example,  suppose  X  =  {a;1,  a;2,  a:3},  £ 
has  two  realizations  /  '  and  £2  with  equal  probability,  and  the  values  of  /( x,  £)  are  as  in  Table  1.  Then  for  N 
reasonably  large  Sn  —  {ad}  with  very  high  probability  while  &i  =  {®2, a;3}  with  probability  one. 

In  the  following  we  show  that  if  the  collection  of  random  objective  functions  {/( x,0}iex  of  (P)  is 
jointly  normal  with  a  nonnegative  and  strictly  diagonally  dominant  covariance  matrix  then 


lim  Pr 

N—>oo 


N 


Sn  H  (^J  Si)  ^  0 


=  1 
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Solution 

e1 

x1 

0 

0 

X 2 

-l 

2 

X3 

2 

-1 

Table  1:  Values  of  /(&,£)  for  an  example 


exponentially  fast.  Before  proceeding  with  a  proof  of  the  above  result  we  make  some  remarks.  The  con¬ 
dition  of  nonnegative  covariance  matrix  implies  that  the  objective  functions  at  different  solutions  are  non- 
negatively  correlated,  and  therefore,  perhaps  it  is  not  very  surprising  that  a  scenario  solution  is  likely  to  be 
optimal  for  the  sample  average  problem.  Recall  that  a  matrix  is  strictly  diagonally  dominant  if  the  diagonal 
element  in  a  row  is  greater  than  the  sum  of  the  absolute  values  of  all  other  elements  in  that  row.  This  con¬ 
dition  can  be  ensured  by  adding  independent  zero-mean  normal  noise  to  the  objective  function  value  for 
each  solution  without  changing  the  optimal  solution  or  optimal  value  of  (P).  We  will  need  the  following 
Lemma. 

Lemma  1  Let  W  be  a  K  +  1  dimensional  Gaussian  random  vector,  W  ~  N(p,  E)  with  the  following  properties: 

1.  For  some  e  >  0,  p0  <  pk  +  efor  all  k  =  1, . . . ,  K  and 

2.  the  covariance  matrix  E  is  nonnegative  and  strictly  diagonally  dominant. 

Condition  2  implies  that  there  exists  6  such  that  0  <  52  :=  min{cr2  —  au,  erf  —  cru}-  Then 


Pr[W0  <Wk\/k=l,...,K\>  {$(-V2e/8))K, 


where  $  is  the  standard  normal  cdf. 

Proof:  Since  po  <  pk  +  e  f°r  all  k  =  1, . . . ,  K,  we  have 

Pr[Wo  <  Wk  Vfc  =  1, ... ,  K]  >  Pr[(W0-p0)-(Wk-pk)<-e\/k  =  l,..,fK}. 

Let  Uk  :  =  ( W0 —po)—(Wk—Pk)iork  =  1 , . . . ,  K,  and  note  that  U  is  a  K  dimensional  Gaussian  random  vector 
with  E[[/fc]  =  0,  E[[/f]  =  CTq  +  erf  -  2aok  >  2S2,  and  E [UkUi]  =  erf  -  er0i  -  er0k  +eJki>  0  for  all  k,  l  =  1, . . . ,  K. 
Consider  another  K  dimensional  Gaussian  random  vector  V  with  E[!4]  =  0,  E[Vj?]  =  E [Uf  \  and  E[VfcVj]  =  0 
for  all  k,  l  =  1, . . . ,  K.  Since  E[VfcVj]  <  E[UkUi]  for  all  k,  l  =  1, . . . ,  K,  by  Slepian's  inequality  [14] 

K 

Pr[Wo  <  Wk  Vfc]  >  Pv[Uk  <  —  e  Vfc]  >  Pr[l4  <  —  e  Vfc]  =  ]^[  Pr[T4  <  —  e], 

fe= l 


where  the  last  identity  follows  from  independence.  Since  EfVj.2]  >  2<52.  Hence  for  any  k, 


Pr[14  <  —  e]  =  Pr 


Vk/yjw[}  <  -e/^/E\v[i\  >  Pr  Wk/^/nV2}  <  -V2e/8 


=  ${-V2  e/6) 


and  the  result  follows.  □ 

Proposition  2  If  the  collection  of  random  objective  functions  {f(x,  £)}a;ex  of  (P)  is  jointly  normal  with  a  nonneg¬ 
ative  and  strictly  diagonally  dominant  covariance  matrix  then 

r  n  i 


lim  Pr 

N—>oo 


Sn  n  ([J  Si)  f  0 


=  l 


exponentially  fast. 
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Proof:  Pick  e  €  (0, 1)  and  let  SI  be  the  set  of  e-optimal  solutions  to  (P),  i.e.  S*  =  { x  £  X  :  E[/(a;,  £)]  < 
E [f(x,  £)]  +  e  \/y  €  X}.  Note  that, 

n  1  r  N  1  N 

Pr  Sn  n  (U  Si)  f  0  >  Pr  [SI  c  (U  Si)]  A  [SN  C  S*e]  >  Pr[5*  C  (U  #)]  +  Pr [SN  C  SI]  -  1, 

2=1  J  L  2=1  J  2=1 

where  the  last  inequality  is  using  Boole's  inequality.  From  SAA  theory  [8]  we  already  know  that 

—  N  e<2 

Pr[5;v  C  SI]  >  (1  -  \X\e 

where  =  maxa;ex[V[/(a:,  £)].  Consider  x*  €  S%,  then 

Pr[^C(Utr^)^0]  >  Pr[x*  e  (Uti  ^)] 

=  1  -  rtli  Pr[®*  £  Si] 

=  l-nti(l-P  r[x*€Si\) 

=  l-nf=i(l-  P <  f(y,0  v  y  €  X  \  {**}]). 

Applying  Lemma  1  with  IT  =  \X  \  {x*}\,Wq  =  f(x*,£)  and  Wk  =  f(y,  £)  for  y  G  X  \  {x*},  we  get 

Pr  [/(**,  0  <  f(v,t)VyeX\{x*}]  >  (<t>(-V2e/26))^, 
for  some  <5  >  0  depending  on  the  covariance  matrix  of  {f(x,  Thus 

JV  1  e2 

Pr  SN  n  (|J  Si)  ^  0  >  (l-(l-($(-v/2e/(5))l-Yl)iv)-|X|e^Ar^.  (4) 

2=1 

Since  $(—\/2e/(f)  >  0  for  <5  >  0,  taking  limits  with  respect  to  A  on  both  sides  of  (4)  we  get  the  desired  result. 

□ 

We  close  this  section  with  some  remarks  on  Proposition  2.  Note  that  when  e  is  very  small  relative  to  S, 
$(—\/2e/5)  «  1/2  and  so  the  right  hand  side  of  (4)  is  approximately  (1  —  (1  —  (1/2)1  X\)N)  —  \X\e~NC  where 
C  is  a  small  constant.  This  indicates  that  smaller  the  solution  set,|X|,  relative  to  the  sample  size  N,  higher  is 
the  probability  that  one  of  the  scenario  solutions  is  optimal.  If  \X\  is  large  relative  to  N,  then  the  bound  in  (4) 
is  very  loose,  even  negative.  Consequently  inequality  (4)  does  not  lead  to  any  meaningful  finite  sample  size 
estimates,  but  implies  asymptotic  behavior  with  respect  to  N  for  fixed  X.  Proposition  2  can  be  extended 
to  more  general  distributions  under  different  assumptions.  For  example,  if  the  random  vector  {/(#*,  £)  — 
f(y ,  DlygA'Ua;*}  (where  x*  is  an  e-optimal  solution  of  (P))  has  a  density  and  contains  a  ball  centered  at  0  of 
radius  at  least  e  in  its  support,  then  there  exists  p  >  0  such  that  Pr  [/(**,£)  <  f(y ,  £)  V  y  €  X  \  {x*}]  >  p. 
We  can  then  replace  ($(— \/2  e/8)fx^  with  p  in  (4)  and  the  conclusion  of  Proposition  2  holds. 

4  Computational  Results 

In  this  section  we  report  some  computational  results  using  a  distributed  implementation  of  the  proposed 
scenario  decomposition  to  solve  a  class  of  two-stage  stochastic  integer  programs  and  a  class  of  nonlinear 
stochastic  programs.  Our  implementation  uses  CPLEX  12.5  as  the  MIP  solver  (in  single  thread  mode)  and 
openMPI  [7]  for  parallelization.  We  use  a  barrier  synchronization  step  after  solving  the  scenario  subprob¬ 
lems  and  evaluating  objective  function  across  the  processors  before  coordinating  the  scenario  solutions  and 
optimal  values.  Preliminary  experiments  with  the  subgradient  method  to  update  the  dual  solutions  A  did 
not  provide  significant  improvement  in  the  lower  bounds.  Consequently,  in  the  current  implementation 
we  use  A  =  0  and  do  not  update  the  dual  solutions.  We  use  an  absolute  optimality  tolerance  of  10 
in  the  overall  algorithm  and  in  solving  the  MIP  subproblems,  and  impose  a  time  limit  of  5000  seconds. 
All  computations  are  done  using  the  boyle  cluster  in  the  ISyE  High  Performance  Computing  Facility 
(http://www.isye.gatech.edu/computers/hpc/).  These  constitute  a  set  of  16  identical  machines 
each  with  8  Intel  Xeon  2.66GHz  chips  and  8  GB  RAM  running  Linux.  We  limit  our  parallelization  to  32 
processors  since  this  is  the  current  limit  on  the  cluster. 
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4.1  Two-stage  stochastic  integer  programming  instances 

Our  first  set  of  results  correspond  to  the  sslp  problem  instances  available  in  the  SIPLIB  library  at  http : 
/  /www2  .  isye  .  gatech  .  edu/  -sahmed/siplib/.  These  are  a  set  of  10  two-stage  stochastic  integer  pro¬ 
gramming  instances  from  a  telecommunications  application  [11],  The  deterministic  equivalent  of  the  largest 
of  these  problems  has  over  1000000  binary  variables.  In  our  implementation  we  use  continuous  relaxation 
of  the  second  stage  variables  in  the  lower  bounding  step  as  explained  in  Section  2  . 

Table  2  presents  the  computational  results.  The  first  column  is  the  name  of  the  instance  in  the  form 
sslp_n-|  Jri2-N  where  n-\  is  the  number  of  first  stage  variables  (all  binary),  n-j  is  the  number  of  second 
stage  variables  (mixed),  and  N  is  the  number  of  scenarios.  Columns  2-6  present  the  optimal  value,  the 
number  of  main  iterations,  total  number  of  solutions  explored,  and  the  total  solution  time  using  1  and 
32  processors  (except  for  the  three  sslp  15  45  instances  where  the  number  of  scenarios  are  less  than  32; 
in  this  case  the  number  of  processors  used  is  equal  to  the  number  of  scenarios).  Each  reported  solution 
time  is  the  average  of  3  independent  runs  to  account  for  the  load  variability  in  the  computing  cluster.  In 
each  of  the  10  instances  an  optimal  solution  is  found  in  the  first  iteration,  i.e.  one  of  the  solutions  to  the 
scenario  subproblems  was  optimal.  In  Figure  1  we  present  speedup  of  the  proposed  distributed  scenario 
decomposition  algorithm  with  respect  to  the  number  of  processors  for  the  instance  sslp_10_5  0_5  00.  The 
appearance  of  super-linear  speedup  is  due  to  the  fact  that  the  run  times  are  averages  of  3  independent  runs, 
and  perhaps  also  due  to  the  cache  effects.  The  parallel  solution  times  are  significantly  smaller  and  serial 
solution  times  are  competitive  in  comparison  to  the  times  reported  in  the  literature  for  these  instances  [10, 
11,  12,  13].  In  particular,  comparing  with  the  parallel  implementation  of  [10]  for  solving  the  dual  problem 
(without  recovering  primal  feasible  solutions)  we  note  that  [10]  reports  2900+  seconds  on  1  processor  and 
800+  seconds  on  32  processors  for  sslp_10_5  0_500. 


Problem 

Optimal  Value 

Iterations 

Solutions 

Time  (secs) 

1  processor 

32  processors 

sslp_5_25_50 

-121.600 

2 

16 

2.3 

0.7 

sslp_5_25_100 

-127.370 

2 

17 

6.0 

1.0 

sslp_l  0_50_50 

-364.640 

4 

123 

65.7 

6.0 

sslp_10_50_100 

-354.190 

4 

187 

153.3 

11.0 

sslp_10_50_500 

-349.136 

3 

241 

1296.0 

46.3 

sslp_10_50_1000 

-351.711 

2 

191 

2691.0 

60.7 

sslp_10_50_2000 

-347.262 

2 

226 

4952.0 

143.3 

sslp_15_45_5 

-262.400 

6 

27 

4.3 

2.0* 

sslp_15_45_10 

-260.500 

13 

116 

35.3 

7.0* 

sslp_15_45_15 

-253.600 

17 

233 

92.7 

13.3* 

Table  2:  Solutions  times  for  the  sslp  instances  (*the  number  of  processors  is  equal  to  the  number  of  sce¬ 
narios.) 


4.2  Nonlinear  stochastic  integer  programming  instances 

Our  second  set  of  instances  is  a  class  of  expected  utility  knapsack  problems  from  [1].  These  problems  are  of 
the  form 

!N  n  n  \ 

-{l/N)^2u(52,vnxi)  '■  ^2aoxj  <  h  xj  e  {0,1}"  > , 
i=i  j= i  t=i  J 

where  U(t)  =  1  —  exp (— t/c),  i.e.  a  negative  exponential  utility  function  with  risk  aversion  coefficient 
c.  We  generated  instances  with  number  of  variables  n  £  {25,  50, 100}  and  number  of  scenarios  N  £ 
{100, 500, 1000}.  The  coefficients  Vij  and  a;j  for  each  instance  were  generated  according  to  [1]  and  c  =  4.  The 
data  for  all  instances  are  available  in  the  SIPLIB  library  at  http :  / / www2  .  isye  .  gatech .  edu/  ~  sahmed/ 
siplib/. 
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Figure  1:  Speedup  for  sslp_10_50_500. 


Note  that  for  these  problem,  due  to  the  monotonicity  of  the  utility  function,  solving  the  z-th  scenario 
subproblem  in  the  lower  bounding  step  (when  A  =  0)  is  equivalent  to  solving  the  (linear)  knapsack  problem 


min 


n  n  1 

ajxi  —  1)  x:i  ^  {0)  1}  ? 

J=i  i= 1  J 


The  lower  bound  is  then  obtained  by  averaging  the  negative  of  the  utility  function  evaluated  at  the  optimal 
values  of  such  knapsack  problems  corresponding  to  each  scenario.  The  upper  bounding  step  constitute 
simply  evaluating  the  scenario  solutions. 

Table  3  presents  the  computational  results.  The  first  column  is  the  name  of  the  instance  in  the  form 
exput  il_nJV  where  n  is  the  number  of  variables  and  N  is  the  number  of  scenarios.  As  before,  columns  2- 
6  present  the  optimal  value,  the  number  of  main  iterations,  total  number  of  solutions  explored,  and  the  total 
solution  time  or  the  %  optimality  gap  in  case  the  5000  sec  time  limit  is  exceeded.  Each  reported  solution 
time  is  the  average  of  3  independent  runs  to  account  for  the  load  variability  in  the  computing  cluster.  Once 
again,  in  all  of  the  instances  an  optimal  solution  is  found  in  the  first  iteration. 


Problem 

Optimal  Value 

Iterations 

Solutions 

Time  (secs)  /  gap  (%) 

1  processor 

32  processors 

exput  il_2  5_100 

-0.242304 

2 

49 

4.7 

1.7 

exputil_25_500 

-0.246211 

2 

68 

24.0 

6.3 

exput  il_25_l  000 

-0.242750 

2 

261 

122.0 

15.7 

exputil_50_100 

-0.246343 

2 

78 

3.3 

2.3 

exputil_50_500 

-0.247751 

3 

462 

100.0 

20.7 

exput  il_5  0_10  00 

-0.247643 

3 

323 

225.3 

41.7 

exput il_l 00_100 

-0.247542 

22 

2072 

4443.0 

690.7 

exput il_100_5 00 

-0.248989 

>22 

>10096 

0.025% 

0.009% 

exput il_100_l 000 

-0.249317 

9 

5418 

0.004% 

721.0 

Table  3:  Solution  times  for  the  exput  il  instances 
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Strengthened  Bounds  for  the 
Probability  of  /c-Out-Of-n  Events 

Feng  Qiu*  Shabbir  Ahmed,  and  Santanu  S.  Dey 
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Abstract 

Given  a  set  of  n  random  events  in  a  probability  space,  represented 
by  n  Bernoulli  variables  (not  necessarily  independent,)  we  consider  the 
probability  that  at  least  k  out  of  n  events  occur.  When  partial  distribution 
information,  i.e.,  individual  probabilities  and  all  joint  probabilities  of  up 
to  m  ( m  <  n)  events,  are  provided,  only  an  upper  or  lower  bound  can  be 
computed  for  this  probability.  Recently  Prekopa  and  Gao  (Discrete  Appl. 
Math.  145  (2005)  444)  proposed  a  polynomial-size  linear  program  to 
obtain  strong  bounds  for  the  probability  of  union  of  events,  i.e.,  k  =  1.  In 
this  work,  we  propose  inequalities  that  can  be  added  to  this  linear  program 
to  strengthen  the  bounds.  We  also  show  that  with  a  slight  modification  of 
the  objective  function  this  linear  program  and  the  inequalities  can  be  used 
for  the  more  general  case  where  k  is  any  positive  integer  less  than  or  equal 
to  n.  We  use  the  strengthened  linear  program  to  compute  probability 
bounds  for  the  examples  used  by  Prekopa  and  Gao,  and  the  comparison 
shows  significant  improvement  in  the  bound  quality. 

Keywords:  linear  programming,  probability  bound,  fc-of-n  event 


1  Introduction 

Let  {Aj  :  j  £  Nj  be  a  set  of  events,  where  N  is  the  index  set  {l,...,n}. 
Define  random  variable  Xj  :  Aj  — >  {0, 1}  as  Xj  =  1  if  Aj  occurs,  and  Xj  =  0, 
otherwise.  Define  /i  as  a  random  variable  that  represents  the  number  of  events, 
AjS,  that  occur,  i.e.,  p,  =  JV  Xj.  Let  A:  be  a  positive  integer  between  1  and  n; 
then  the  probability  that  (at  least)  k  out  of  n  events  in  {Ai, An}  occur  is 
denoted  by  P(@  >  k). 

Computation  of  P(/x  >  k )  is  often  needed  in  applications.  For  example,  in  a 
maximum  availability  location  problem  [3] ,  the  probability  that  the  population 
in  a  subregion  is  covered  by  at  least  k  facilities  is  used  to  calculate  the  expected 

*Tel:+l-(608)-354-5638,  Fax:  +l-(630)-252-9868,  Email:  fqiu@gatech.edu 
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coverage.  Another  example  is  the  reliability  problem  of  communication  net¬ 
works,  where  each  arc  fails  with  a  certain  probability  and  we  want  to  compute 
or  approximate  the  node-to-node  reliability  of  the  system. 

Accurate  computation  of  P(/x  >  k )  is  not  easy.  A  complete  distribution  func¬ 
tion  for  a  system  of  Bernoulli  events  involves  an  exponential  size  of  data,  which 
is  difficult  to  handle  when  n  is  large.  Furthermore,  in  practice,  the  complete 
distribution  function  for  (Ad,  ...,Xn)  is  often  not  available,  unless  the  Bernoulli 
random  numbers  Xj  are  independent  from  each  other.  The  available  informa¬ 
tion  is  often  the  marginal  distributions  and  joint  distributions  up  to  level  to 
(m  -C  n).  In  this  situation,  it  is  desirable/preferable  to  compute  a  lower  or 
upper  bound  using  only  a  limited  amount  of  information. 

A  number  of  bounding  results  that  utilize  different  amounts  of  information 
under  different  settings  have  been  proposed.  The  classic  Boole  inequality  yields 
a  lower  bound  for  the  union  of  events  with  only  individual  marginal  probabili¬ 
ties,  i.e. ,  k  =  1  and  m  =  1.  Dawson  and  Sanko  [4]  provide  a  sharp  lower  bound 
for  the  probability  of  the  union  of  events  using  marginal  and  pairwise  joint  prob¬ 
abilities,  i.e.,  to  =  2;  Kwerel  [5,  6]  developed  bounds  that  can  utilize  one  more 
degree  of  joint  probabilities,  i.e.,  to  =  3;  Prekopa,  Boros  and  other  researchers 
[7,  8,  2]  employed  certain  linear  programs  to  derive  lower  and  upper  bounds 
for  a  general  case  where  to  can  be  any  positive  integer  less  than  or  equal  to  n. 
Most  of  these  works  use  aggregation  of  the  individual  joint  probabilities  of  the 
same  degree  in  their  formulations,  called  binomial  moments.  As  a  consequence 
of  summation,  individual  probability  information  is  lost,  and  the  given  infor¬ 
mation  is  not  fully  utilized.  To  make  better  use  of  the  available  information, 
Prekopa  and  Gao  [9]  derived  LP-based  bounds  for  a  union  of  events  using  par¬ 
tially  aggregated  information.  We  call  this  LP  model  as  the  partially  aggregated 
model  (PAM).  The  numerical  examples  showed  that  their  bounds  were  at  least 
as  strong  as  other  results  using  binomial  moments. 

In  this  paper,  we  first  observe  that  the  results  in  [9]  which  is  stated  for 
the  case  of  k  =  1,  generalizes  in  a  straightforward  way  to  the  case  where  k  is 
any  positive  integer  less  than  or  equal  to  n.  Our  key  contribution  is  to  identify 
inequalities  that  can  be  appended  to  PAM  to  strengthen  the  bounds.  We  call  the 
resulting  LP  model  as  strengthened  partially  aggregated  model  (SPAM).  These 
results  are  presented  in  Section  2.  We  test  the  strength  of  the  extra  inequalities 
using  the  instances  in  [9],  and  the  results  show  significant  improvement  of  the 
bound  quality  of  SPAM  over  PAM.  We  also  computationally  identify  a  family 
of  probability  distributions  for  which  the  bounds  provided  by  SPAM  has  the 
largest  improvement  over  the  bounds  provided  by  PAM  model.  We  report  these 
computational  results  in  Section  3. 
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2  Strengthening  the  Partially  Aggregated  Model 

2.1  The  Partially  Aggregated  Model  for  k  e  {1, . . .  ,n} 

Let  S'  be  a  subset  of  N.  Denote  the  joint  probability  that  all  events  in  the  set 
S  occur  as  ps  :=  P(fj^£s  A)-  Let  sJt  :=  J2  lsi=tPS-  We  first  state  the  main 

S:  j£S 

result  of  [9]  using  our  notation. 

Theorem  1  ([9]).  Given  the  joint  distributions  up  to  level  m,  i.e.,  |S|  <  m, 
a  lower  (or  upper)  bound  for  the  probability  of  the  union  of  these  events,  i.e., 
P(/r  ^  1);  can  be  calculated  by  solving  the  following  linear  program  (PAM): 


n  n 


min  (max) 

(la) 

*= i  j=i 

n  n 

s-t-  E  E  a?  =i 

(lb) 

i—0  j= 1 

EQ%-  =7st 

(lc) 

Vij  >  0. 

(Id) 

□ 

The  LP  model  above  and  those  in  [7,  8,  2,  9]  are  derived 
reasonings  (See  Appendix  1  for  a  brief  review.) 

using  probabilistic 

We  next  show  that  the  bounds  on  P(/x  >  k)  for  k  S  {1, 

. . . ,  n}  can  be  ob- 

tained  by  solving  (1)  by  changing  the  objective  function  (la)  to  Y^i=k  Sj=i  vij- 
As  pointed  out  in  [9] ,  any  bound  that  uses  ps  can  be  recovered  as  the  value 
of  the  objective  function  of  the  Boolean  LP  model  [1],  which  consists  of  the 
probability  of  each  possible  outcome.  Let  C  be  a  subset  of  N,  and  let  the 
probability  of  the  outcome  associated  with  C  be  denoted  as 


wc  ■=  p  h  n  Ai)  P|(  fl  A) 

\  ieC  j&N\C 

where  Aj  represents  that  event  A:j  does  not  occur.  Note  that  these  outcomes  are 
mutually  exclusive.  The  probability  of  any  event  can  be  represented  by  using 
the  probabilities  of  an  appropriate  set  of  outcomes.  For  example,  the  probability 
that  at  least  k  out  of  n  events  occur  can  be  expressed  as  Y^ccn  \c\>k  *c-  Given 
joint  distributions  up  to  level  m,  i.e.,  ps,  |Sj  <  m,  a  lower  (or  upper)  bound  of 
P (p  >  k)  can  be  obtained  by  the  following  Boolean  linear  program  [1]: 

min(max)  E  WC 

C:\C\>k 
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(2) 


S.t.  ^  Wc  =  1 
CCN 

^2  WC  =  PS,  VS  C  TV,  \S\  <  TO 
c-.scc 

wc  >  0  VC  C  TV. 


Formulation  (2)  consists  of  an  exponential  number  of  variables  and  may  not  be 
useful  in  practice.  In  order  to  obtain  the  PAM  LP  model,  we  first  duplicate 
each  row  with  right-hand  side  ps  in  (2)  |S|-1  times,  add  up  rows  with  |5|  =  t 
and  j  £  S  for  each  t  and  j ,  and  then  arrive  at 


EE7  E 

i=k  j— 1  |C|=i 


WC 


j&C 


EE 7  E  wg  =  1 


i=l  j— 1  \C\—i 

c-  j&C 


i  —  1 


EL.!]  E  WG=  E  ps  t  =  l,...,mj  =  l,...,n 

i=t  '  '  |C?|=i  „  |S|=t 

tec  jes 

wc>  0  VC  C  TV. 


(3a) 

(3b) 

(3c) 

(3d) 


Equations  (3a)-(3d)  are  the  resulting  rows  by  duplicating  and  aggregating  the 
rows  in  (2) .  Notice  that  the  following  variables  share  the  same  coefficient  in  each 
row:  wc  with  \C\  =  i  and  j  €  C.  Therefore,  we  aggregate  these  variables  into 
a  single  variable  and,  for  notational  simplicity,  scale  the  resulting  aggregrated 
variable  as 


-7  E 


wc 


=  1,  -,n  j  =  1, 


(4) 


c-  |C|=i 
j&c 


Thus  we  obtain  the  following  linear  program: 


n  n 


min 

(5a) 

i=k  j= 1 

n  n 

s-t-  E  E  vg  = 1 

(5b) 

j= 1  i=0 

E  \  =  7s*  i  =  l,-,m  j  =  l,...,n 

i~t  ^  ^ 

(5c) 

1  ^ — -v 

Vij  =  -  2_^  WG  *  =  I,  •••,  n  j  =  1,  ...n 

(5d) 

1  c.\C\=i 

L '  jec 
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v  >  0  *  =  1, n  j  =  1, n 
wc  >  0  VC  C  N. 


(5e) 

(5f) 


Now  observing  that  the  variables  wc  do  not  appear  in  the  objective  function 
(5a),  we  may  relax  (5),  by  dropping  the  variables  wc  and  the  constraints  (5d) 
and  (5f),  to  obtain  the  PAM  model. 


2.2  Strengthened  PAM 


As  discussed  in  the  previous  section,  PAM  is  obtained  by  relaxing  (5).  In  order 
to  strengthen  PAM,  we  project  out  the  variables  wc  from  the  set  defined  by  the 
constraints  (5d)  and  (5f)  to  obtain  valid  inequalities  which  we  append  to  PAM. 
Again  we  are  able  to  accomplish  this  using  the  fact  that  the  wc  variables  do 
not  appear  in  the  objective  function. 

We  need  the  following  lemma. 

Lemma  1.  Let  Sl  :=  {x  G  {0, 1}"  |  Y^'j=ixi  =  *}■  F°r  1  <  *  <  «  —  1, 
dim(5*)  =  n. 


Proof.  We  list  n  vectors  from  Sl  that  are  linearly  independent.  Set  vectors 
id, . . . ,  vl+1  as 


1  k  G  {1,..  .,*  +  l}\{j} 
0  otherwise. 


Set  vectors  vl+2 , . . . ,  vn  as 


v 


3 

k 


l  fee  {l, —  1} u {i} 

0  otherwise. 


(6) 

(7) 


It  is  straighforward  to  check  that  u1, . . . ,  vn  belong  to  Sl  and  are  linearly  inde¬ 
pendent.  □ 


Proposition  2.  Let 

Wi  =  {(...,  wc,  ■■■,  Vij, ...)  £  x  Rn  :  Vij  =  j  ^  wc  j  =  1,  -n}.  (8) 

*  c.\c\=i 
J&c 

Then  the  projections  ofWiS  onto  v  space  are  the  following  sets: 


Projv(Wf)  =  {(va, ...,  vin)  e  K”  |  -  (*  -  1  )vij  +  ^  vit  >  0,  >  0  j  £  N} 

i  =  2, ...,  n  —  2,  (9) 

Projv(Wn- 1)  =  {(u(n_i)i,  ...,u(n_i)n)  G  R”  |  -(n-2)u(n_i)i+^U(„_i)t  >  0,  j  G  N} 

(10) 


5 


DISTRIBUTION  A:  Distribution  approved  for  public  release. 


and 


Projv(Wn)  =  {(vnl,  ...,vnn)  G  R”  |  vnl  =  vnt  2  <  t  <  n,  vnl  >  0}.  (11) 


Proof.  Note  that  Proff  W,)  is  a  cone.  Therefore,  for  simplicity  of  analysis,  we 
may  scale  the  v  variables  and  thus  without  loss  of  generality  assume  that 

Wi  :=  {(..., eK^  x  R"  :vij  =  ^  wcj  =  l,-.n}.  (12) 


We  show  that  (9)  holds  for  an  arbitrary  i  between  2  and  n  —  1,  where 
\C\  =  i.  To  simplify  notations,  for  now,  we  drop  subscript  i  in  the  constraint. 

Let  w  =  (...,%,...)  G  R+^  and  v  =  (...,  Vjj, ...)  G  Rn.  We  then  rewrite  (12)  as 
follows 

Wi  =  {(m,  v)  G  rP  xRn  :v  =  Gw}, 

where  G  =  G  Knx(i)  is  the  coefficient  matrix  and  gc  =  ( gc ,  g3c, ...,  gc)T 

is  the  column  corresponding  to  the  variable  wc-  ffc  =  1  if  j  G  G\  g°c  =  0  if 
j  C .  G  consists  of  all  permutations  of  the  0-1  vector  with  i  ones  and  n  —  i 
zeros.  Rewrite  (9)  as  follows: 

Vi  :=  {!)Gl":(e  -  iej)T  v  >0;  eJv  >  0  j  =  1, ...,  n}, 

where  e  G  R"  is  a  vector  with  all  components  equal  to  one  and  ej  G  R"  is  the 
j-th  unit  vector. 

To  show  that  Vi  is  the  projection  of  W,  into  u-space,  we  need  to  show 

v  G  Vi  O  There  is  a  w  £  R^  such  that  Gw  =  v. 

By  Farkas’  lemma,  we  have  that 

There  is  a  w  G  R^  such  that  Gw  =  v  u  G  Rn  such  that  uT G  >  0  =>•  uTv  >  0. 


Let  {ue}  be  the  set  of  extreme  rays  of  the  cone{rt  :  uTG  >0}.  It  is  sufficient 
to  show  that  the  set  of  constraint  vectors  in  V),  i.e.,  (e  —  iej)  and  ej,  is  exactly 
{ue}- 

We  first  show  (e  —  iej)  and  ej  are  extreme  rays. 

(e  —  iej)T  is  a  feasible  solution  for  the  cone  {u  :  uTG  >  0}: 


{e-iej)Tgc 


0  C  :  j  £  C 
i  -  1  C  :j  f  C. 


6 


DISTRIBUTION  A:  Distribution  approved  for  public  release. 


Furthermore,  the  products  above  have  (”_1)  zeros,  which  means  that  ("_i) 
constraints  in  uTG  >  0  are  tight.  Notice  that  the  binding  constraint  vectors  gc 
are  all  the  permutations  of  vectors  with  the  j-th  position  fixed  to  one.  Therefore, 
by  Lemma  1,  they  span  R"_1  and  we  can  find  n  —  1  linearly  independent  vectors 
among  them  which  have  zero  products  with  (e  —  iej).  Thus,  (e  —  iej)  is  an 
extreme  ray  of  {u  :  uTG  >  0}. 

As  for  ej ,  we  have  ej  G  as  follows 


eJi 


C-.jeC 

C'.jic. 


Therefore,  e,j  is  a  feasible  solution  for  the  cone  {u  :  uT G  >  0}  and  the  product 
above  has  (rl~1)  zeros  that  correspond  to  gc  with  j  £  C.  Among  those  columns 
that  have  zero  products  with  ej ,  by  using  Lemma  1  we  can  find  n  —  1  linearly 
independent  columns.  Therefore,  ej  is  an  extreme  ray  of  {u  :  uT G  >  0}. 

Now  we  show  by  contradiction  that  there  are  no  other  extreme  rays.  Let  A 
0  be  a  new  distinct  extreme  ray;  and  let  gse  =  ( gge , ...,  g*s  ,  )T  £  =  1,  n—1 

be  the  set  of  linear  independent  columns  of  G  with  A T gse  =0^=1,  ...,n  —  1. 
Since  A  ej  for  all  j  =  1, ...,  n,  we  have 

$t  such  that  gtSi  =0  V£  =  1, ...,  n—1  (13) 

since  otherwise,  e*  would  be  the  extreme  ray  formed  by  the  half  planes  {uT gge  > 
0  £  =  1, ..., n—1}.  Similarly,  since  A  /  (e-  iej )  for  all  j  =  1, ..., n,  we  have 

$t  such  that  gtS(  =  1  W  =  1, ...,  n  —  1.  (14) 

Using  (13)  and  gse  >  0,  we  obtain  that  At.  <  0  for  some  t*  (Otherwise,  A T gse  > 
0  for  some  i  by  (13)).  By  (14),  there  is  a  £*  such  that  g ^  =  0.  Since  A T gse»  = 
^ tdSe *  =  and  9st»  7^  Oj  we  can  find  an  index  t  such  that  Aj  >  0 
and  gg^  =  1.  Let  g*  be  a  vector  obtained  by  switching  the  components  at 
position  t*  and  t  in  vector  gge,  ■  Note  that  g*  is  also  a  vector  of  G.  Then 
A T g*  =  A Tgse,  +  At*  —  Aj  <  0.  Thus,  A  is  not  a  feasible  ray  of  the  cone 
{u  :  uTG  >  0}.  Therefore,  there  are  no  extreme  rays  of  {u  :  uTG  >  0}  other 
than  {(e  —  iej),  j  =  1, ...,  n}  and  Proj(Wi)  =  Vi  for  i  =  2, ...,  n  —  2. 

For  Proj(Wn_i),  we  can  show  that  {(e  —  iej)  j  =  1,  ...,n}  are  extreme  rays 
of  {u  :  uTG  >  0}.  However  {ej}  are  not  extreme  rays  in  this  case  since  each  row 
of  G  has  only  one  zero.  Using  a  similar  argument  as  that  in  (2),  we  can  show 
that  {(e  —  iej)  j  =  1, ...,  n}  are  the  only  extreme  rays  and  that  Proj(H/ra)  =  Vn. 

The  equations  in  (10)  hold  since  vnj  =  Vn  for  all  j  =  1,  ...n.  □ 


Notice  that  the  projection  of  W\  yields  only  non-negativity  constraints  on 
Vij,  j  =  1,  ...,n  and  the  non- negativity  of  v^n_ij  j  for  j  =  1,  ...,n  is  implied  by 
-(*  -  l)^(n— 1  )j  +  E tjtj  yit  >  0  j  =  1,  n. 

The  inequalities  in  Proposition  2, 

~{i  -  1  )vjj  +  y  yVit>  0,  Vij  >  0  j  =  i  =  2,...,n-  1  (15) 

t¥=J 
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and 


Vni  =  Vnt  2  <  t  <  n, 


(16) 


are  valid  for  model  (5)  and  they  are  not  implied  by  the  constraints  in  PAM.  We 
call  the  linear  program  with  these  additional  constraints  as  the  Strengthened 
Partially  Aggregated  Model  or  SPAM. 


3  Numerical  Examples 

In  the  following,  we  calculate  lower  bounds  with  the  strengthened  model  for 
the  examples  in  [9],  and  compare  the  new  bounds  with  those  presented  in  [9]. 
Note  that  the  lower  bounds  in  [9]  are  only  for  the  probability  of  the  union  of 
events,  i.e.,  k  =  1.  Examples  1,  2,  and  3  have  20  Bernoulli  variables  each,  i.e. , 
Xi, ...,  X20 •  All  the  outcomes  and  their  probabilities  are  presented  in  Table  1,  2, 
and  3,  respectively.  With  these  tables,  we  can  obtain  the  marginal  probabilities, 
pair-wise  joint  probabilities,  and  liigher-order  joint  probabilities  by  adding  up 
appropriate  rows. 


Table  1:  Probability  Distributions  in  Example  1 


Outcome 


Bernoulli  Random  Variables 

Xi  X2  X3  X4  X5  Xq  X7  X8  Xq  X10  Xu  X12  X13  X14  X15  X16  X17  X18  X19  X20 


Probability 


1 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0.012214 

2 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0.022231 

3 

1 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0.023287 

4 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0.033976 

5 

1 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0.034761 

6 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0.044582 

7 

1 

0 

0 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0.045943 

8 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

1 

0.055185 

9 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

0 

1 

0 

1 

0.056404 

10 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0.066317 

11 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0.067685 

12 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0.077376 

13 

1 

0 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0.078648 

14 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0.088878 

15 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0.292514 

We  compare  the  lower  bounds  corresponding  to  k  =  1  produced  by  SPAM 
with  the  results  in  [9]  in  Table  4.  Note  that  all  of  the  bounds  in  Table  4  are 
calculated  with  only  marginal  probabilities  and  pair-wise  joint  probabilities, 
except  those  in  the  fourth  column. 

The  second  column  cites  the  results  obtained  by  the  formula  derived  in  [4], 
which  can  be  obtained  by  further  aggregating  the  Boolean  LP  model.  We  call 
this  the  Fully  Aggregated  Model  or  FAM  (See  (18)  in  Appendix  1).  The  third 
column  cites  the  results  obtained  by  PAM  derived  in  [9].  The  fourth  column 
cites  the  results  obtained  by  PAM  but  with  three  binomial  moments,  including 
the  triple- wise  joint  probabilities.  Prekopa  and  Gao  also  developed  heuristics  in 
[9]  to  strengthen  the  lower  bounds  by  PAM,  but  the  best  results  obtained  are 
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Table  2:  Probability  Distributions  in  Example  2 


Outcome 

Bernoulli  Random  Variables 

X1  X2  X3  X4  X5  Xq  X7  X8  Xq  X10  Xxl  X12  X13  X14  X15  X16  X17  X18  X19  X20 

Probability 

i 

0 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0.00896463 

2 

1 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

1 

0 

1 

1 

0 

0 

1 

0 

1 

0.02492217 

3 

1 

0 

1 

1 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0.02109813 

4 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

0 

0 

1 

0 

0 

0.03779353 

5 

0 

1 

1 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

1 

1 

0 

0.04632610 

6 

1 

0 

0 

0 

0 

0 

1 

0 

1 

1 

1 

1 

1 

0 

1 

0 

0 

0 

0 

1 

0.04284324 

7 

1 

1 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0.07804262 

8 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0.02536991 

9 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0.01916672 

10 

1 

1 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0.06340085 

11 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

0 

0 

0 

1 

0.07315289 

12 

1 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0.07732742 

13 

0 

1 

0 

0 

1 

1 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0.02248020 

14 

0 

1 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0.09164494 

15 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0.36745660 

Table  3:  Probability  Distributions  in  Example  3 


Outcome 

Bernoulli  Random  Variables 

Vi  V2  V3  X4  X5  Xq  X7  X8  X9  X10  Xu  X12  X13  X14  X15  X16  X17  X18  X19  X20 

Probability 

1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0.10176880 

2 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0.11299200 

3 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0.01514044 

4 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0.05684733 

5 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0.03270125 

6 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

1 

0.10050750 

7 

0 

1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0.07306695 

8 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0.01743922 

9 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0.06284498 

10 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0.05830101 

11 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0.06833096 

12 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0.07153743 

13 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0.04503293 

14 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

0 

0.03487869 

15 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0.14860240 

Table  4:  Comparison  with  results  in  [9]  for  k  =  1 
Example  FAM  PAM  PAM(3)  SPAM 

4  0.8275266  0.8580833  0.8864460  0.9394167 

5  0.8658182  0.9100646  0.9354100  0.9482229 

6  0.8985498  0.9435812  0.9587778  0.9715460 
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no  better  than  those  obtained  by  involving  triple-wise  probabilities,  which  are 
listed  in  the  fourth  column.  Therefore,  we  ignore  the  results  of  the  heuristics. 
The  fifth  column  gives  the  results  obtained  by  SPAM.  We  observe  in  Table  4 
that  the  extra  inequalities  derived  in  this  study  significantly  improve  the  lower 
bounds  calculated  by  PAM. 

Now,  we  use  the  same  instances  to  calculate  the  lower  bounds  for  the  prob¬ 
ability  of  3-coverage,  i.e. ,  k  =  3,  and  summarize  the  results  in  Table  5,  which 


Table  5:  Lower  Bounds  for  3-Coverage  ( k  =  3) 


Example 

FAM 

PAM 

PAM(3) 

SPAM 

i 

0.6643058 

0.665249 

0.6768353 

0.6745504 

2 

0.7298830 

0.7528989 

0.7847514 

0.8005835 

3 

0.7387907 

0.7819380 

0.8482093 

0.8614323 

again  shows  that  SPAM  produces  significantly  tighter  lower  bounds  than  both 
FAM  and  PAM.  We  also  calculate  the  upper  bounds  and  compare  the  results 
with  the  results  in  [9].  However,  we  do  not  observe  improvement  in  the  new 
upper  bounds. 

In  Table  6  we  provide  examples  to  show  that  for  certain  probability  dis¬ 
tributions,  the  improvement  by  the  inequalities  in  Proposition  2  can  be  much 
more  significant  than  the  improvement  observed  in  Examples  1,  2,  and  3.  The 
specific  distrobution  under  which  the  lower  bounds  in  Table  6  are  calculated 
are  obtained  computationally  by  an  optimization  model  provided  in  Appendix 
2.  We  run  this  optimization  model  with  n  fixed  at  10,  m  fixed  at  3,  and  k 
varying  from  2  to  4.  The  optimization  problems  are  solved  by  the  nonlinear 
solver  BARON  and  the  optimal  solutions  produce  probability  distributions  un¬ 
der  which  the  gaps  between  PAM  and  SPAM  are  maximal.  In  our  experiments 
we  enforce  a  solution  time  limit  of  ten  hours.  Columns  3  and  4  of  Table  6  list 
the  lower  bounds  yield  by  PAM  and  SPAM,  respectively.  For  comparison  we 
also  calculate  the  lower  bounds  using  FAM  and  the  Boolean  model  (BM)  and 
list  them  in  Columns  2  and  5,  respectively. 


Table  6:  Comparison  of  Lower  Bounds 


k 

FAM 

PAM 

SPAM 

BM 

2 

0.6296296 

0.6296296 

1.0000000 

1.0000000 

3 

0.3412896 

0.3432099 

0.8065844 

0.8065844 

4 

0.2108243 

0.2143992 

0.8599802 

0.8599802 
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Appendix  1 

We  briefly  motivate  the  partially  aggregated  model  introduced  in  [9].  Let  (/J) 
represent  the  combination  of  choosing  i  out  v.  We  observe  that  [7] 

(?)=  e  n*-  w 

V  7  SCN:\S\=ieeS 

Notice  that  both  sides  of  above  equation  are  random  numbers.  After  taking 
expectation,  we  have  the  following  equation 

E(-)^=  E  Ps>  (18) 

t=l  V  7  SCN:\S\=i 

where  ps  =  ^Plces7^)  and  the  right-hand  side  is  the  7-th  binomial  moment. 
Note  that  (18)  are  the  constraints  of  the  LP  model  in  [7,  8,  2],  which  we  call 
Fully  Aggregated  Model  or  FAM. 

A  simple  consequence  of  (17)  is  as  follows  [9] 
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Taking  expectation  of  both  sides  in  above  equation,  we  have  the  following 

£(•_  i)^'=  E  PS’  (2°) 

t= 1  V  7  SCJV:|S|=i,je5 

where  =  P(/z  =  t  fl  Xj  =  1).  Note  that  the  right-hand  side  of  (20)  only 
consists  of  the  probabilities  of  those  events  that  A:)  occurs.  Since  the  probability 
of  the  union  of  . . . ,  An  can  be  expressed  as  follows 

Theorem  3  ([9]). 

n  n  1 

p(m>  i)  =  EETWiA  (21) 

3=1  i=l 

an  upper  bound  or  a  lower  bound  on  the  probability  of  union  of  events  can  be 
obtained  by  minimizing  or  maximizing  the  right-hand  side  of  (21)  over  the  set 
of  constraints  in  (20)  and  the  non-negativity  constraints  [9]. 

Appendix  2 

The  following  optimization  model  identifies  the  probability  distributions  for 
which  the  gap  between  SPAM  and  PAM  is  maximal.  Let  z*SPAM  and  z*PAM 
be  the  lower  bounds  for  the  probability  that  at  least  fc-out-of-n  events  occur, 
obtained  by  SPAM  model  and  PAM  model,  respectively.  Let  variables  ps,  for 
all  S  C  TVand  (S)  <  m,  represent  the  probability  distributions  we  aim  to  obtain. 
Note  that  sj  is  now  a  function  of  ps .  Let  no  and  irj  be  the  dual  variables  for 
constraint  (5b)  and  (5c),  respectively;  p]  be  the  dual  variable  corresponding 
to  the  constraint  (15)  for  i  =  2,  ..,n  —  1  and  j  =  1  ,...,n,  and  pJn  be  the  dual 
variable  corresponding  to  the  constraint  (16)  for  j  =  2,  ...,n. 


gap  —  max  (. z*SPAM  —  zPAM ) 


PS 


=  max  (minjV^  vu  :  (5b),  (5c),  (5e),  (15),  and  (16)} 

no  iua  J  J 


PS  Vij 


j— 1  i—k 
n 

—  min{V^  Vi  :  (5b),  (5c),  and  (5e)}) 

n.-  • 


i—k 

n  m  ^ 

max  (max{7To  +  >  >  - 

n0  +  n{  <  ei  j  =  1, ...,  n 


7  7 

Ks  i  '■ 


no 


i=i 


+  E  (■  H  ~  ^  ~  +  E^«  Sett  =  2, ...,  n  —  1;  j  =  l,...,t 


l±3 


no 


+  E  "H-Ea4< 


1=2 
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i—l 


^0+^2  [  +  Mn  <  en  j  =  2,... 


tto,  ^l,^n  free,  ,  ill  >  0  i  =  2, ...,  n  -  1} 


+  max{X  ~VH  :  X  V-‘J  =  1 :  X 

i=fe  j— 1  j,i  i=t 

v{  >  0}) 


Vij  =  -s{{x)  t 


=  max 

ps,  7r,/t,ir 


'  XX  -  XX! 


i=i *=i 


j=l 


ttq  +  K  <  ei  j  =  1,  —,n 


i= 1 


tto  +  X  -  H  - -  1)/X5  +  X^«  <ett  =  2 1 


-o+x  :vxx^< 


j=2 


TTO  +X!  f”)7^  +Mn  <  en  J  =  2,  ...,  n 


X%  =  !;X 


t;,--,-  =  -5: 


'(z)  t  =  l,...,m  j  =  1,..., 


tto ,7Ti,Mn  free,  >  0  i  =  2 ,...,n-  1,  v\  >  0}, 

where  et  =  0  if  f  <  A:;  et  =  1,  otherwise. 

The  feasible  set  for  (...,ps, ...)  is  as  follows 


=  {(..., ps, ...)  G  m1")+-+(-}  :  3rt  G  Rf  s.t.  ^  wc  <  l, ps 

CCN 
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=  1  ,-,m  j  =  1, n, 


;  j  =  l,-,n 


n, 


wc  C  TV,  |S|  <  to}. 

c-.scc 
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Abstract 

In  this  work,  we  introduce  and  study  the  forbidden-vertices  problem.  Given  a  polytope  P  and  a 
subset  X  of  its  vertices,  we  study  the  complexity  of  linear  optimization  over  the  subset  of  vertices  of 
P  that  are  not  contained  in  X.  This  problem  is  closely  related  to  finding  the  fc-best  basic  solutions 
to  a  linear  problem.  We  show  that  the  complexity  of  the  problem  changes  significantly  depending 
on  the  encoding  of  both  P  and  A'.  We  provide  additional  tractability  results  and  extended  formula¬ 
tions  when  P  has  binary  vertices  only.  Some  applications  and  extensions  to  integral  polytopes  are 
discussed. 


1  Introduction 


Given  a  nonempty  rational  polytope  P  C  Mn ,  we  denote  by  vert(P),  faces(P),  and  facets(P)  the  sets  of 
vertices,  faces,  and  facets  of  P,  respectively,  and  we  write  /(P)  :=  |facets(P)  |.  We  also  denote  by  xc(P) 
the  extension  complexity  of  P,  that  is,  the  minimum  number  of  inequalities  in  any  linear  extended 
formulation  of  P,  i.e.,  a  description  of  a  polyhedron  whose  image  under  a  linear  map  is  P  (see  for 
instance  £§).)  Finally,  given  a  set  A'  C  vert(P),  we  define  forbf  P.  X)  :=  conv(vert(P)  \  X),  where 
conv(.S’)  denotes  the  convex  hull  of  S  C  1".  This  work  is  devoted  to  understanding  the  complexity  of 
the  forbidden-vertices  problem  defined  below. 

Definition  1.  Given  a  poly  tope  P  C  R",  a  set  X  C  vert(P),  and  a  vector  c  £  Rn,  the  forbidden-vertices 
problem  is  to  either  assert  vert (P)  \  X  =  0,  or  to  return  a  minimizer  ofcTx  over  vert (P)  \  X  otherwise. 


Our  work  is  motivated  by  enumerative  schemes  for  stochastic  integer  programs  f9[,  where  a  series  of 
potential  solutions  are  evaluated  and  discarded  from  the  search  space.  As  we  will  see  later,  the  problem 
is  also  related  to  finding  different  basic  solutions  to  a  linear  program. 

To  address  the  complexity  of  the  forbidden-vertices  problem,  it  is  crucial  to  distinguish  between  differ¬ 
ent  encodings  of  a  polytope. 

Definition  2.  An  explicit  description  of  a  polytope  P  C  K”  is  a  system  Ax  <  b  defining  P.  An  implicit 
description  of  P  is  a  separation  oracle  which,  given  a  rational  vector  x  £  R",  either  asserts  x  £  P,  or  returns  a 
valid  inequality  for  P  that  is  violated  by  x. 

*  Georgia  Institute  of  Technology  and  Pontificia  Universidad  Catolica  de  Chile:  gangulo@gatech.edu 
t Georgia  Institute  of  Technology:  sahmed@isye.gatech.edu,  santanu.dey@isye.gatech.edu 
^Otto-von-Guericke-Universitat  Magdeburg:  kaibel@ovgu.de 
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Note  that  an  extended  formulation  for  P  is  a  particular  case  of  an  implicit  description.  When  P  admits  a 
separation  oracle  that  runs  in  time  bounded  polynomially  in  the  facet  complexity  of  P  and  the  encoding 
size  of  the  point  to  separate,  we  say  that  P  is  tractable.  We  refer  the  reader  to  fl9l  Section  14]  for  a 
deeper  treatment  of  the  complexity  of  linear  programming. 

We  also  distinguish  different  encodings  of  a  set  of  vertices. 

Definition  3.  An  explicit  description  of  X  C  vert(P)  is  the  list  of  the  elements  in  X.  If  X  =  vert  (F)for  some 
face  F  of  P,  then  an  implicit  description  of  X  is  an  encoding  of  P  and  some  valid  inequality  for  P  defining  F. 

Below  we  summarize  our  main  contributions. 


•  In  Section|2l  we  show  that  the  complexity  of  optimizing  over  vert  (P)\X  or  describing  forb(P,  X) 
changes  significantly  depending  on  the  encoding  of  P  and/or  X.  In  most  situations,  however, 
the  problem  is  hard. 

•  In  Section  [3]  we  consider  the  case  of  removing  a  list  X  of  binary  vectors  from  a  0-1  polytope  P. 
When  P  is  the  unit  cube,  we  present  two  compact  extended  formulations  describing  forb(  [0, 1]”,  A'). 
We  further  extend  this  result  and  show  that  the  forbidden-vertices  problem  is  polynomially  solv¬ 
able  for  tractable  0-1  polytopes. 

•  Then  in  Section[4]we  apply  our  results  to  the  //-best  problem  and  to  binary  all-different  polytopes, 
showing  the  tractability  of  both.  Finally,  in  Section  [5]  we  also  provide  extensions  to  integral 
polytopes. 

The  complexity  results  of  Sections[2]and[3]lead  to  the  classification  shown  in  TableCD  depending  on  the 
encoding  of  P  and  X,  and  whether  P  has  0-1  vertices  only  or  not.  Note  that  (*)  is  implied,  for  instance, 
by  Theorem [181  Although  we  were  not  able  to  establish  the  complexity  of  (**),  Proposition[T9lpresents 
a  tractable  subclass. 


P 

General 

0-1 

Explicit 

Implicit 

Explicit 

Implicit 

Explicit 

X 

ATP-hard  (Thm.llH 

Polynomial  for  fixed  X\  (Prop. [6] 

AfP-hard  for  \X\  =  1  (Thm.g) 

Polynomial 

Polynomial  (Thm.[l6l 

Implicit 

WP-hard  fProp.lTol 

WP-hard  (*) 

(**) 

AfP-hard  (Thm.llsl 

Table  1:  Complexity  classification. 


In  constructing  linear  extended  formulations,  disjunctive  programming  emerges  as  a  practical  power¬ 
ful  tool.  The  lemma  below  follows  directly  from  0  and  the  definition  of  extension  complexity.  We  will 
frequently  refer  to  it. 

Lemma  4.  Let  P\,...,Pk  he  nonempty  polytopes  in  R”.  If  Pi  =  {x  G  R"|  3yt  G  Rmi  :  EiX  +  Ftyi  = 
hi,  yi  >  0},  then  conv(uf=1P;)  =  {i6l"|  3xi  G  R”,  yi  G  Rmy  A  G  Rfc  :  x  =  Jfi=i  xu  Eixi  +  FiVi  = 
A ihi ,  E-=i  Ai  =  1,  r/i  >  0,  A  >  0}.  In  particular,  we  have  xc  (conv(ujL1Pi))  <  J2i=i(xc(pi)  +  !)• 


2  General  polytopes 


We  begin  with  some  general  results  when  P  C  R"  is  an  arbitrary  polytope.  The  first  question  is  how 
complicated  forb(P,  X)  is  with  respect  to  P. 
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Proposition  5.  For  each  n,  there  exists  a  polytope  Pn  C  R"  and  a  vertex  vn  G  vert(P„)  such  that  Pn  has  2n  + 1 
vertices  and  n2  +  1  facets,  while  forb(P„,  {wn})  has  2”  facets. 


Proof.  Let  Qn  :=  [0,  l]n  fl  L,  where  L  :=  {x  G  R”|  lTx  <  |}  and  1  is  the  vector  of  ones.  It  has  been 
observed  fl]  that  Qn  has  2n  + 1  facets  and  n2  +  1  vertices.  We  translate  Q„  and  define  Q'n  :=  Qn  —  —1  = 
[—  p  1  —  i]"  fl  L' ,  where  L'  :=  {x  G  R"|  lTx  <  ^}.  Since  Q'n  is  a  full-dimensional  polytope  having 
the  origin  in  its  interior,  there  is  a  one-to-one  correspondence  between  the  facets  of  Q'n  and  the  vertices 
of  its  polar  Pn  :=  ((}'.„)*  and  vice  versa.  In  particular,  Pn  has  n2  +  1  facets  and  2n  +  1  vertices.  Let 
v  G  vertf  P„  )  be  the  vertex  associated  with  the  facet  of  Q'n  defined  by  L' .  From  polarity,  we  have 
forb(Pn,  {?;})*  =  [— 1  —  i] ".  Thus  forb(P„,  {c})*  is  a  full-dimensional  polytope  with  the  origin  in 
its  interior  and  2n  vertices.  By  polarity,  we  obtain  that  forb(P„,  {-(;})  has  2"  facets.  □ 


Note  that  the  above  result  only  states  that  forb(P,  X)  may  need  exponentially  many  inequalities  to  be 
described,  which  does  not  constitute  a  proof  of  hardness.  Such  a  result  is  provided  by  Theorem ITT1  at 
the  end  of  this  section.  We  first  show  that  forb(P,  X)  has  an  extended  formulation  of  polynomial  size 
in  /(P)  when  both  P  and  X  are  given  explicitly  and  the  cardinality  of  X  is  fixed. 

Proposition  6.  Suppose  P  =  {x  G  R”|  Ax  <  b}.  Using  this  description  of  P,  and  an  explicit  list  of  vertices 
X,  we  can  construct  an  extended  fornndation  o/forb(P,  X)  that  requires  at  most  f(Pfx‘+1  inequalities,  i.e., 
xc(forb(P,X))  < /(P)lxl+1. 


Proof.  LetX  =  {ui, . . .  ,^|a'|}  and  define Tx  :=  {PiH-  •  -nP|x||  Fz  G  facets(P),  u*  f  Fz,  i  =  1, . . . ,  |X|}. 
We  claim 

forb(P,  X)  =  conv  (Uf <sfxF)  ■ 

Indeed,  let  w  G  vert(P)  \  X.  For  each  %  =  1 .... .  \X\,  there  exists  F,  G  facets(P)  such  that  w  G  F,  and 
Vi  f  Fi.  Therefore,  letting  F  :=  F\  n  •  ■  ■  n  F\x\>  wc  have  F  G  Tx  and  w  G  F,  proving  the  forward 
inclusion.  For  the  reverse  inclusion,  consider  F  G  Tx-  By  definition,  F  is  a  face  of  P  that  does  not 
intersect  X,  and  hence  F  C  forb(P,  X). 

By  Lemma|H  we  have  xc(forb(P,  X))  <  (xc(^)  +  1).  Since  xc(P)  <  f(F)  <  f(P)  —  1  for  each 

proper  face  F  of  P  and  | Tx  \  <  ./'( /J) ' x  I,  the  result  follows.  □ 

Note  that  when  X  =  {v},  the  above  result  reduces  forb(P,  {(;})  to  the  convex  hull  of  the  union  of  the 
facets  of  P  that  are  not  incident  to  v,  which  is  a  more  intuitive  result.  Actually,  we  can  expect  describing 
forb(P,  X)  to  be  easier  when  the  vertices  in  X  are  "far"  thus  can  be  removed  "independently",  and 
more  complicated  when  they  are  "close".  Proposition[6]can  be  refined  as  follows. 

The  graph  of  a  polytope  P,  or  the  1-skeleton  of  P,  is  a  graph  G  with  vertex  set  vert(P)  such  that  two 
vertices  are  adjacent  in  G  if  and  only  if  they  are  adjacent  in  P. 

Proposition  7.  Let  G  be  the  graph  of  P.  Let  X  C  vert(P)  and  let  (Xl,  . . . ,  Xrn)  be  a  partition  of  X  such  that 
X,  and  Xj  are  independent  in  G,  i.e.,  there  is  no  edge  connecting  X,  to  Xj,for  all  1  <  i  <  j  <  m.  Then 

m 

forb(P,  X)  =  P|  forb(  P.  X,  ). 

i=l 


Proof.  We  only  need  to  show  forb(P,  X)  D  fjj=i  forb(P,  Xf).  For  this,  it  is  enough  to  show  that  for  each 
c  we  have  max{cTa;  :  x  G  forb(P,  X)}  >  max  {cT£  :  x  G  n!=i  forb(P,  Xj)} .  Given  c,  let  v  be  an  optimal 
solution  to  the  maximization  problem  in  the  right-hand  side,  and  let  W  C  vert(P)  be  the  set  of  vertices 
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w  of  P  such  that  cTw  >  c 1  v.  Observe  that  W  induces  a  connected  subgraph  of  the  graph  G  of  P  since 
the  simplex  method  applied  to  max{cTx  :  x  £  P}  starting  from  a  vertex  in  W  visits  elements  in  It  '  only. 
Hence,  due  to  the  independence  of  Ai, . . . ,  Xm,  either  there  is  some  w  £  W  with  w  ^  X\  U  •  •  •  U  Xm,  in 
which  case  we  have  w  £  forb(P,  A')  and  c 1  w  >  c T  v  as  desired,  or  W  C  A,  for  some  i,  which  yields  the 
contradiction  v  £  forb (P,  X *)  C  forb(P,  W)  with  cTx  <  cTv  for  all  x  £  vert (P)  \  W.  □ 


Conversely,  we  may  be  tempted  to  argue  that  if  forb(P,  X)  =  forb(P,  X\)  (T  forb(P,  A2),  then  X\  and 
A’ 2  are  "far".  However,  this  is  not  true  in  general.  For  instance,  consider  P  being  a  simplex.  Then 
any  X  C  vert(P)  is  a  clique  in  the  graph  of  P,  and  yet  forb(P,  A')  =  forb(P,  Ai)  n  forb(P,  X%)  for  any 
partition  (Ai,  A'2)  of  A. 

Proposition  [7]  generalizes  the  main  result  of  [12]  regarding  cropped  cubes.  Moreover,  the  definition 
of  being  "croppable"  in  [f2j]  in  the  case  of  the  unit  cube  coincides  with  the  independence  property  of 
Proposition  [7] 

Recall  that  a  vertex  of  an  n-dimensional  polytope  is  simple  if  it  is  contained  in  exactly  n  facets.  Propo¬ 
sition  [7]  also  implies  the  following  well-known  fact. 

Corollary  8.  If  X  is  independent  in  the  graph  of  P  and  all  its  elements  are  simple,  then 

forb(P, A)  =  PH  Q  Hv, 

vex 


where  Hv  is  the  half-space  defined  by  the  n  neighbors  ofv  that  does  not  contain  v. 


Proof.  The  result  follows  from  Proposition  0  since,  as  A  is  simple,  we  have  forb(P,  {(;})  =  Pfl  Hv  for 
any  v  £  X.  □ 


Observe  that  when  P  is  given  by  an  extended  formulation  or  a  separation  oracle,  /(P)  may  be  ex¬ 
ponentially  large  with  respect  to  the  size  of  the  encoding,  and  the  bound  given  in  Proposition  [6]  is 
not  interesting.  In  fact,  in  this  setting  and  using  recent  results  on  the  extension  complexity  of  the  cut 
polytope  0,  we  show  that  removing  a  single  vertex  can  render  an  easy  problem  hard. 

Let  Kn  =  ( Vn,En )  denote  the  complete  graph  on  n  nodes.  We  denote  by  CUT(n),  CUT°(n),  and 
sf-CUT(n)  the  convex  hull  of  the  characteristic  vectors  of  cuts,  nonempty  cuts,  and  st-cuts  of  Kn,  re¬ 
spectively. 

Theorem  9.  For  each  n,  there  exists  a  set  Sn  C  rH™-1)/2  with  [gn|  =  2n~1  +  n  —  1  and  a  point  vn  £  Sn  such 
that  linear  optimization  over  Sn  can  be  done  in  polynomial  time  and  xc(conv(S'n))  is  polynomially  bounded,  but 
linear  optimization  over  Sn  \  {Tv,}  is  AfV-hard  and  xc(con v(Sn  \  {?;„}))  grows  exponentially. 


Proof.  Let  Tn  :=  {n2le|  e  £  En},  where  le  is  the  e-th  unit  vector,  and  define  Sn  :=  vert  (CUT0 (n))  UT„. 

We  have  that  linear  optimization  over  Sn  can  be  done  in  polynomial  time.  To  see  this,  suppose 
we  are  minimizing  cTx  over  Sn.  Let  xT  and  xc  be  the  best  solution  in  Tn  and  CUT°(n),  respec¬ 
tively.  Note  that  computing  xT  is  trivial,  and  if  c  has  a  negative  component,  then  xT  is  optimal. 
Otherwise,  c  is  nonnegative  and  xc  can  be  found  with  a  max-flow /min-cut  algorithm.  Then  the 
best  solution  among  xT  and  xc  is  optimal.  Now,  consider  the  dominant  of  CUT°(n)  defined  as 
CUT°(n)+  :=  CUT°(n)  +  1%)^2 .  From  (4),  we  have  that  CUT°(n)+  is  an  unbounded  polyhedron 

having  the  same  vertices  as  CUT°(n),  and  moreover,  it  has  an  extended  formulation  of  polynomial  size 
in  n.  Let  L  :=  {x  £  ]R"(n-1)/2|  Xe  —  n2}-  Then  CUT°(n)+  IT  L  is  a  polytope  having  two  classes 
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of  vertices:  those  corresponding  to  vert  (CUT°(n))  and  those  belonging  to  the  hyperplane  defining  L. 
Let  W  be  the  latter  set.  Since  conv(W)  C  conv(T„),  we  obtain  conv(SVi)  =  conv  (CUT°(n)  U  T„)  = 
conv  ((CUT°(n)  U  W)  U  T„))  =  conv  ((CUT°(n)+  fl  L)  U  Tn) .  Applying  disjunctive  programming  in 
the  last  expression  yields  a  compact  extended  formulation  for  conv(.S'n). 

Now,  let  vn  be  any  point  from  Tn,  say  the  one  corresponding  to  {s,  t\  £  E.  We  claim  that  linear  opti¬ 
mization  over  Sn  \  { vrl }  is  A/P-ha rd .  To  prove  this,  consider  an  instance  of  max{c  1  x\  x  £  st-CUT(n)}, 
where  c  is  a  positive  vector.  Let  c  :=  max{ce|  e  £  E}.  Let  d  be  obtained  from  c  as 

j  /  ce  e  7^  {+  t} 

e  \  ce  +  cn2  e  =  {s,t} 


and  consider  the  problem  max{dTx|  x  £  Sn  \  { vn } } .  We  have  that  every  optimal  solution  to  this  problem 
must  satisfy  xst  =  1.  Indeed,  if  a;  £  Tn\{vn},  then  for  some  e  £  En\{{s,  t}}  wehave  dT  x  =  dexe  =  cen2. 
If  x  £  vert  (CUT0  (n))  is  not  an  st-cut,  then  xst  =  0  and  thus  dTx  <  cn2.  On  the  other  hand,  if  x  is  an 
st-cut,  then  xst  =  1  and  thus  d  1  x  >  dstXst  =  cst  +  cn2.  Therefore  xst  =  1  in  any  optimal  solution,  and 
in  particular,  such  a  solution  must  define  an  sf-cut  of  maximum  weight.  Finally,  since  xsf  <  1  defines  a 
face  of  con v(5„  \  {vn})  and  conv(Sn  \  {u„})  PI  {x  £  R"(n_1)/2|  xst  =  1}  =  sf-CUT(n),  we  conclude  that 
xcfconvf.SV,  \  {«„}))  is  exponential  in  n,  for  otherwise  applying  disjunctive  programming  over  all  pairs 
of  nodes  s  and  t  would  yield  an  extended  formulation  for  CUT(n)  of  polynomial  size,  contradicting 
the  results  in  0.  □ 

Contrasting  Proposition  [6]  and  Theorem  |9]  shows  that  the  complexity  of  forbf  P.  X)  depends  on  the 
encoding  of  P.  On  the  other  hand,  in  all  cases  analyzed  so  far.  A'  has  been  explicitly  given  as  a  list. 
Now  we  consider  the  case  where  X  =  vert(F’)  for  some  face  F  of  P. 

Proposition  10.  Given  a  poly  tope  P  C  R"  and  a  face  F,  both  described  in  terms  of  the  linear  inequalities 
defining  them,  optimizing  a  linear  function  over  vert(P)  \  vert(F)  is  AfV-hard.  Moreover,  xc(conv(vert(P)  \ 
vert(F)))  cannot  be  polynomially  bounded  in  the  encoding  length  of  the  inequality  description  of  P  and  thus  not 
in  n. 


Proof.  Let  a  £  Z™  and  b  £  Z+,  and  consider  the  binary  knapsack  set  S  :=  {x  £  {0, 1}”|  aTx  <  b}. 
Let  P  :=  {x  £  [0,  l]ra|  2 aTx  <  26+1}  and  note  that  S  =  P  IT  Zn.  It  is  straightforward  to  verify  that 
x  £  vert(P)  is  fractional  if  and  only  if  2a  1  x  =  26+1.  Then,  if  F  is  the  facet  of  P  defined  by  the  previous 
constraint,  we  have  S  =  vert(P)  \  vert(P).  The  second  part  of  the  statement  is  a  direct  consequence  of 
[17]  using  multipliers  4*  as  discussed  after  Remark  3.4  of  that  reference.  □ 

It  follows  from  Theorem  [9]  and  Proposition  [10]  that  only  when  P  and  X  are  explicitly  given  there  is 
hope  for  efficient  optimization  over  forb(P,  A). 

In  a  similar  vein,  when  the  linear  description  of  P  is  provided,  we  can  consider  the  vertex-enumeration 
problem,  which  consists  of  listing  all  the  vertices  of  P.  We  say  that  such  a  problem  is  solvable  in 
polynomial  time  if  there  exists  an  algorithm  that  returns  the  list  in  time  bounded  by  a  polynomial  of  n, 
f(P),  and  the  output  size  |vert(P)|.  In  |[8j  it  is  shown  that  given  a  partial  list  of  vertices,  the  decision 
problem  "is  there  another  vertex?"  is  A/P-hard  for  (unbounded)  polyhedra,  and  in  (3)  this  result  is 
strengthened  to  polyhedra  having  0-1  vertices  only.  Building  on  these  results,  we  show  hardness  of  the 
forbidden- vertices  problem  (Def.Q])  for  general  polytopes. 

Theorem  11.  The  forbidden-vertices  problem  is  AfV-hard,  even  if  both  P  and  X  are  explicitly  given. 


Proof.  Let  Q  =  {x  £  R"  :  Ax  =  6,  x  >  0}  be  an  unbounded  polyhedron  such  that  vert(Q)  C  {0,1}"- 
In  |3j,  it  is  shown  that  given  the  linear  description  of  Q  and  a  list  X  C  vert(Q),  it  is  A/P-hard  to 
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decide  whether  X  f  vert(Q).  Let  P  be  the  polytope  obtained  by  intersecting  Q  with  the  half-space 
defined  by  Xi  <  n  +  1,  and  let  F  be  the  facet  of  P  associated  with  this  constraint.  Then  we  have 
vert(P)  =  vert(Q)  U  vert (P),  Y^i=i  xi  —  n  f°r  x  £  vert(Q),  and  Y^i=x  Xi  =  n  +  1  for  x  G  vert(F).  Now, 
given  the  description  of  P  and  a  list  X  C  vert(Q)  C  vert(P),  consider  the  instance  of  the  forbidden- 
vertices  problem  min  {^”_1  xt  :  x  G  vert(P)  \  A'}.  The  optimal  value  is  equal  to  n  +  1  if  and  only  if 
A'  =  vert(Q).  Since  the  reduction  is  clearly  polynomial,  the  result  follows.  □ 

In  fact,  it  also  follows  from  (3j  that  the  forbidden-vertices  problem  for  general  polytopes  becomes  hard 
already  for  |A|  =  n.  Fortunately,  the  case  of  0-1  polytopes  is  amenable  to  good  characterizations. 


3  0-1  polytopes 


We  consider  polytopes  having  binary  vertices  only.  We  show  that  forb(P,  A)  is  tractable  as  long  as  P  is 
and  A'  is  explicitly  given.  Our  results  for  P  =  [0,1]"  allow  us  to  obtain  tractability  in  the  case  of  general 
0-1  polytopes. 


3.1  The  0-1  cube 

In  this  subsection  we  have  P  =  [0,  l]n,  and  therefore  vert(P)  =  {0, 1}".  We  show  the  following  result. 
Theorem  12.  Let  X  be  a  list  of  n-dimensional  binary  vectors.  Then  xc(forb([0,  l]n,  A))  <  0(n\X\). 

For  this,  we  present  two  extended  formulations  involving  0{n |A|)  variables  and  constraints.  The 
first  one  is  based  on  an  identification  between  nonnegative  integers  and  binary  vectors.  The  sec¬ 
ond  one  is  built  by  recursion  and  lays  ground  for  a  simple  combinatorial  algorithm  to  optimize  over 
forb([0,  1]  "  ,  X)  and  for  an  extension  to  remove  vertices  from  general  0-1  polytopes. 


3.1.1  First  extended  formulation 

Let  N  :=  {1, . . . ,  n}  and  J\f  :=  {0, . . . ,  2n  —  1}.  There  exists  a  bijection  between  {0, 1}"  and  N  given  by 
the  mapping  a(v)  :=  2 l~1Vi  for  all  v  G  {0, 1}”.  Therefore,  we  can  write  {0,  l}n  =  {v°, . . . ,  v2"-1}, 

where  vk  gives  the  binary  expansion  of  k  for  each  k  e  A f,  that  is,  vk  =  cr”1(/c).  Let  X  =  {vkl , . . . ,  vkm}, 
where  without  loss  of  generality  we  assume  ki  <  ki+ 1  for  all!  =  1, . . . ,  m  —  1.  Also,  let  Afx  :=  {k  G 
Af\  vk  G  A}.  Then  we  have 

{0, 1}"  \  X  =  lx  G  {0, 1}"  |  Y,  2i~1Xi  fMxl- 

l  ieN  } 


Now,  for  integers  a  and  b,  let 

K(a,  b)  =  J  x  G  {0, 1}”|  a<Y  2  i-1*i  <  b  \  . 

I  i£N  ) 

lib  <  a,  then  K  (a,  b)  is  empty.  Set  ko  =  —  1  and  fcm+ 1  =  2n.  Then  we  can  write 

m 

{0, 1}"  \  A  =  (J  K{kt  +  1,  ki+\  —  1). 

1=0 
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forb([0, 1]",  X)  =  conv  K(ki  +  1,  ki+\  —  1)J  =  conv  conv(K(ki  +  1,  kt+1  -  1))J  .  (1) 

For  k  £  Af,  let  Nk  :=  {i  £  N\  vk  =  1}.  From  fl5l  we  have 


, 


thus  conv(A'(a,  b))  has  0{n)  facets.  Finally,  combining  this  and  (lj,  by  Lemma[4j  we  have  that  f orb ([0, 1]",  A') 
can  be  described  by  an  extended  formulation  having  0{n |A|)  variables  and  constraints. 


3.1.2  Second  extended  formulation 

Given  A  C  {0, 1}”,  let  X'  denote  the  projection  of  X  onto  the  first  n  —  1  coordinates.  Also,  let  X  := 

X  \  X,  where  X  is  constructed  from  X  by  flipping  the  last  coordinate  of  each  of  its  elements.  The  result 
below  is  key  in  giving  a  recursive  construction  of  forbQO,  1]",  A). 

Proposition  13.  {0,  l}n  \  X  =  [({0,  l}71"1  \  A')  x  {0, 1}]  U  A. 

Proof.  Given  v  £  {0, 1}",  let  v'  £  {0,  l}”-1  and  v  £  {0, 1}”  be  the  vectors  obtained  from  v  by  removing 
and  by  flipping  its  last  coordinate,  respectively. 

Let  v  £  {0, 1}”  \  A.  If  v  £  A,  since  v  f  X ,  we  have  v  £  X.  Otherwise  v'  <f  A',  and  thus  v  £ 

({oor-^^oxioo}. 

For  the  converse,  note  that  A  C  {0, 1}”  \  A.  Finally,  if  v  £  ({0,  l}n_1  \  A')  x  {0, 1},  then  v'  ^  A'  and 
thus  v  f  X.  □ 

The  second  proof  of  Theorem[l2lfollows  from  Proposition[l3]by  induction.  Suppose  that  forb(  [0,  l]n_1 ,  A') 
has  an  extended  formulation  with  at  most  (n  —  1)(|A'|  +  4)  inequalities,  which  holds  for  n  =  2.  Then 
we  can  describe  forb([0, 1]"_1,  A')  x  {0, 1}  using  at  most  (n  —  1)(|A'|  +  4)  +  2  inequalities.  Since  the 
polytope  conv(A’)  requires  at  most  |A|  inequalities  in  an  extended  formulation,  we  obtain  an  extended 
formulation  for  forb([0,  l]n,  A)  of  size  no  more  than  [(n  —  1)(|A'|  +4)  +  2  +  1]  +  [|A|  +  1]  <  n(|A|  +  4). 


3.2  General  0-1  polytopes 

In  this  subsection  we  analyze  the  general  0-1  case.  We  show  that  the  encoding  of  A  plays  an  important 
role  in  the  complexity  of  the  problem. 
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3.2.1  Explicit  X 


In  order  to  prove  tractability  of  the  forbidden  vertices  problem  corresponding  to  general  0-1  tractable 
polytopes,  we  introduce  the  notion  of  A- -separating  faces  for  the  0-1  cube. 

Definition  14.  Given  X  C  {0, 1}",  wesaythat  T  C  facesQO,  l]ra)  is  X -separating if  {0,l}n\X  =  U f^FCi 
{0,  l}n.  We  denote  by  p{X)  the  minimal  cardinality  of  an  X-separating  set. 


Clearly,  if  T  is  X-separating,  then 

min  {cTx|  x  £  {0, 1}"  \  X}  =  min  min  {cT x\  x  £  F  fl  {0, 1}"}  . 


Thus,  if  we  can  find  an  X-separating  family  of  cardinality  bounded  by  a  polynomial  on  n  and  |X|,  then 
we  can  optimize  in  polynomial  time  over  {0,  l}n  \  X  by  solving  the  inner  minimization  problem  for 
each  F  £  T  and  then  picking  the  smallest  value. 

Proposition  15.  For  every  nonempty  set  X  C  {0, 1}",  we  have  p(X)  <  n|X|. 


Proof.  For  each  y  £  {0,  l}n  \  X,  let  0  <  k  <  n  —  1  be  the  size  of  the  longest  common  prefix  between 
y  and  any  element  of  X,  and  consider  the  face  F  =  F(y )  :=  {x  £  [0,  l]n|  Xi  =  yi  VI  <  i  <  k  +  1}  = 
(j/i, . . . ,  yk,yk+ 1)  x  [0,  l]ra_fc_1.  Then  the  collection  T  :=  {F(y)\  y  £  {0, 1}"  \  X}  is  X-separating  since 
any  y  £  {0, 1}™  \  X  belongs  to  F(y)  and  no  element  of  X  lies  in  any  F(y)  by  maximality  of  k.  Clearly, 
\JF\  <  n\X\  since  each  face  in  T  is  of  the  form  (vi, . . . ,  v k,  1  —  ^fc+i)  x  [0,  for  some  v  £  X.  □ 

In  other  words,  letting  X*  be  the  projection  of  X  onto  the  first  i  components  and  X'  :=  (XI_1  x  {0, 1})  \ 
X1 ,  where  X1  :=  {0, 1}  \  X1,  we  have 

n 

{0, \}n  \  X  =  U  [x,:  x  {0,  l}""4  . 

i= 1 


Moreover,  it  also  follows  from  the  proof  of  Proposition[l5]that  //(X)  is  at  most  the  number  of  neighbors 
of  X  since  if  (v\, . . . ,  i’k,  1  —  ^+1,^+2)  ■  •  • ,  v„)  is  a  neighbor  of  v  £  X  that  also  lies  in  X,  then  the  face 
{(vi, . . .  ,Vk,  1  —  ^fc+i)}  x  [0,  in  not  included  in  T  in  the  construction  above. 

Now,  let  P  C  R"  be  an  arbitrary  0-1  polytope.  Note  that  vert(P)  \  X  =  vert(P)  fl  ({0, 1}”  \  X).  On  the 
other  hand,  if  T  C  faces([0, 1]")  is  X'-separating,  then  {0, 1}"  \X  =  U fg^F  fl  {0, 1}”.  Combining  these 
two  expressions,  we  get 

vert(P)  \  X  =  (J  vert(P)  (T  P  D  {0,  l}n  =  (J  P  D  P  D  {0, 1}". 

F Gf  FGF 


Note  that  since  P  has  0-1  vertices  and  P  is  a  face  of  the  unit  cube,  then  POP  is  a  0-1  polytope.  Moreover, 
if  P  is  tractable,  so  is  P  fl  F.  Recalling  that  p(X)  <  n \ X  from  Proposition  11 51  we  obtain 

Theorem  16.  If  P  C  R”  is  a  tractable  0-1  polytope,  then  the  forbidden-vertices  problem  is  polynomially  solvable. 

In  fact,  a  compact  extended  formulation  for  vert(P)  \  X  is  available  when  P  has  one. 

Proposition  17.  For  every  0-1  polytope  P  and  for  every  nonempty  set  X  C  vert(P),  we  have 

xc(forb(P,  X))  <  p(X)(xc(P)  +  1). 
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Proof.  The  result  follows  from 


forb(P,  X)  =  conv  (J  POFfljO,  l}n 
\f&e 

Lemma|4j  and  xc(f’)  <  xc(P)  for  any  face  F  of  P.  □ 

Observe  that  when  P  is  tractable  but  its  facet  description  is  not  provided.  Theorem Il6l is  in  contrast  to 
Theorem[9]  Having  all  vertices  with  at  most  two  possible  values  for  each  component  is  crucial  to  retain 
tractability  when  X  is  given  as  a  list.  However,  when  X  is  given  by  a  face  of  P,  the  forbidden-vertices 
problem  can  become  intractable  even  in  the  0-1  case. 


3.2.2  Implicit  X 

Let  TSP(n  )  denote  the  convex  hull  of  the  characteristic  vectors  of  Hamiltonian  cycles  in  the  complete 
graph  Kn .  Also,  let  SUB(n)  denote  the  subtour-elimination  polytope  for  Kn  with  edge  set  En. 

Theorem  18.  For  each  n,  there  exists  a  0-1  polytope  Pn  C  fljH"-1)/2  and  a  facet  Fn  G  facets(Pn)  such  that 
linear  optimization  over  Pn  can  be  done  in  polynomial  time  and  xc (Pn)  is  polynomially  bounded,  but  linear 
optimization  over  vert (P„)  \  vert(Pn)  is  AfV-hard  and  xc(forb(P„,  vert(Fn)))  grows  exponentially. 

Proof.  Given  a  positive  integer  n,  consider  T+  :=  {a;  G  {0,  l}En\  xe  =  n  +  1},  Tn  :=  {x  G 

{0,  l}En\  J2e£En  xe  =  n  —  1},  and  F[n  :=  TSP(n)  fl  {0,  l}En.  The  idea  is  to  "sandwich"  Hn  between  T~ 
and  Tf  to  obtain  tractability,  and  then  remove  T~  to  obtain  hardness. 

We  first  show  that  linear  optimization  over  T~  L)FtnUTf  is  polynomially  solvable.  Given  c  G  ]Rra(n_1h2, 
consider  max{c  1  x\  x  G  T~  U  H„  U  Tf }.  Let  x  and  x+  be  the  best  solution  in  T~  and  Tf,  respectively, 
and  note  that  x~  and  x+  are  trivial  to  find.  Let  m  be  the  number  of  nonnegative  components  of  c.  If 
m  >  n  +  1,  then  x+  is  optimal.  If  to  <  n  —  1,  then  x~  is  optimal.  If  m  =  n,  let  xn  G  {0,  l}En  have 
a  1  at  position  e  if  and  only  if  ce  >  0.  If  xn  belongs  to  //,,,  which  is  easy  to  verify,  then  it  is  optimal. 
Otherwise  either  x~  or  x+  is  an  optimal  solution. 

Now  we  show  that  linear  optimization  over  Hn  U  Tf  is  A'P-hard.  Given  c  G  Rn(n~h/2  with  c  >  0, 
consider  min{cT.T|  x  G  Hn}.  Let  c  :=  max{ce|  e  G  En}  and  define  de  :=  ce  +  nc.  Consider  min{dTa;|  x  G 
Hn  U  T+ }.  For  any  x  G  T+,  we  have  dTx  =  (n  +  1  )nc  +  cTx  >  (n  +  1  )nc.  For  any  x  G  Hn/  we  have 
d 1  x  =  n2c  +  c  x  <  n2c  +  nc=  (n  +  1  )nc.  Hence,  the  optimal  solution  to  the  latter  problem  belongs  to 
Hn  and  defines  a  tour  of  minimal  length  with  respect  to  c. 

Letting  Pn  :=  con v(T~  U  Hn  U  Tf ),  we  have  that  Pn  is  a  tractable  0-1  polytope,  JfeeE  Xf'  —  71  ~  * 
defines  a  facet  Fn  of  P„,  and  vertf Pn)  \  vert(Pn)  =  //,,  U  'If ,  which  is  an  intractable  set.  Now,  since 
forb(Pn,  vert(Pn))  =  conv(7T„  U  Tf ),  we  have  that  f2e€Eri  xe  >  n  defines  a  facet  of  forb(Pn,  vert(P„)) 
and  forb(P„,  vert(P„))  n  {x  G  Rra(n_1h2|  xe  =  n}  =  TSP(n).  Therefore,  xc(forb(Pn,  vert(P„))) 

is  exponential  in  n  fl8|.  It  remains  to  show  that  xc(P„)  is  polynomial  in  n. 

Let  Tn  :=  {a;  G  {0,1}^  |  E eeEnxe  =  n}  and  let  FIn  :=  Tn\  Fln  be  the  set  of  incidence  vectors  of 
n-subsets  of  En  that  do  not  define  a  Hamiltonian  cycle.  Given  x  G  {0,  1 } K" ,  let  N(x)  be  the  set  of  neigh¬ 
bors  of  a;  in  [0,  l]En,  let  L(x)  be  the  half-space  spanned  by  N(x)  that  does  not  contain  x,  and  let  C(x)  := 
[0,  l]En  \  L(x).  Finally,  let  A„  :=  conv(T“  UT„UT+ )=  {x  G  [0,  \}En  \  n  -  1  <  Ee6sn  xe<n+l}. 

We  claim  that  Pn  =  con v(T~  U  SUB(n)  U  T+ ).  By  definition,  we  have  Pn  C  conv(T“  U  SUB(n)  U  T~ ). 
To  show  the  reverse  inclusion,  it  suffices  to  show  SUB(n)  C  Pn.  Note  that  any  two  distinct  elements  in 
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Tn  can  have  at  most  \En\  —  2  tight  inequalities  in  common  from  those  defining  A„ .  Thus,  Tn  defines 
an  independent  set  in  the  graph  of  An.  Moreover,  for  each  x  £  Tn  the  set  of  neighbors  in  An  is  N{ x) 
and  thus  all  vertices  in  Tn  are  simple.  As  Hn  C  Tn,  we  have  that  //  ,,  is  simple  and  independent,  and 
by  Corollary[8]we  have 

Pn  =  A„  n  P)  L(x)  =  An\  (J  C(x). 

x£Hn  x£Hn 


Since  SUBfn)  C  Ani  from  the  second  equation  above,  it  suffices  to  show  C(x)  n  SUB(n)  =  0  for  all 
x  £  II „ ■  For  this,  note  that  for  any  x  £  Hn,  there  exists  a  set  0  f  S  C  14  such  that  x(6(S))  <  1,  which 
implies  y(5(S))  <  2  for  all  y  £  N(x).  Thus  C[x)  fl  SUB(n)  =  0  as  x(S(S))  >  2  is  valid  for  SUB(n). 

Finally,  applying  disjunctive  programming  and  since  xc(SUB(n))  is  polynomial  in  n  l20ll.  we  conclude 
that  Pn  has  an  extended  formulation  of  polynomial  size.  □ 


To  conclude  this  section,  consider  the  case  where  P  is  explicitly  given  and  X  is  given  as  a  facet  of  P. 
Although  we  are  unable  to  establish  the  complexity  of  the  forbidden-vertices  problem  in  this  setting, 
we  present  a  tractable  case  and  discuss  an  extension. 

Proposition  19.  Let  P  =  {a;  £  M"|  Ax  <  b}  be  a  0-1  poly  tope,  zvhere  A  is  TU  and  b  is  integral.  Let  F  be  the 
face  of  P  defined  by  aj  x  =  bi,  Then 

forb(P,  vert(F))  =  P  fl  {x  £  R"|  aj x  <  bi  —  1}. 


Proof.  We  have 

vert(P)  \  vert(P)  =  P  fl  {x  £  {0,  l}ra|  aj x  <  bi  —  1}. 

Since  A  is  TU  and  b  in  integral,  the  set  P  fl  {x  £  R"|  ufx  <  h,  —  1 }  is  an  integral  polyhedron  contained 
in  P,  which  is  a  0-1  polytope.  □ 


Since  any  face  is  the  intersection  of  a  subset  of  facets,  the  above  result  implies  that  removing  a  single 
face  can  be  efficiently  done  by  disjunctive  programming  in  the  context  of  Proposition  [19]  Also,  if  we 
want  to  remove  a  list  of  facets,  that  is,  X  =  UFe0vert(P)  and  Pisa  subset  of  the  facets  of  P,  then  we 
can  solve  the  problem  by  removing  one  facet  at  a  time.  However,  if  P  is  a  list  of  faces,  then  the  problem 
becomes  hard  in  general. 

Proposition  20.  If  P  is  a  list  of  faces  of[  0, 1]",  then  optimizing  a  linear  function  over  {0, 1}"  \  Ufg^^KF) 
is  AfV-hard. 


Proof.  Let  G  =  (V,  E )  be  a  graph.  Consider  the  problem  of  finding  a  minimum  cardinality  vertex  cover 
of  G,  which  can  be  formulated  as 


min  Eiev  x% 
s.t.  Xi  +  Xj  >  1  V{i,  j}  £  E 
Si  €{0,1}  Vi  €  V. 

Construct  P  by  adding  a  face  of  the  form  F  =  {x  €  [0,  l]n|  a =  0,  xj  =  0}  for  each  {i,j}  G  E. 
Then  the  vertex  cover  problem,  which  is  A/P-hard,  reduces  to  optimization  of  a  linear  function  over 
(0, 1}"  \  UFe^vert(P).  □ 
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4  Applications 


4.1  k- best  solutions 


The  k- best  problem  defined  below  is  closely  related  to  removing  vertices. 

Definition  21.  Given  a  nonempty  0-1  polytope  P  C  Rra,  a  vector  c  £  1",  and  a  positive  integer  k,  the  k-best 
problem  is  to  either  assert  |vert(P)|  <  k  and  return  vert(P),  or  to  return  v\ ,Vk  £  vert(P),  all  distinct, 
such  that  max{cTr)j|  i  =  1, . . . ,  k}  <  min{cTu|  v  £  vert(P)  \  {ui, . . . ,  r'fc}}. 


Since  we  can  sequentially  remove  vertices  from  0-1  polytopes,  we  can  prove  the  following. 

Proposition  22.  Let  P  C  [0,  l]n  be  a  tractable  0-1  polytope.  Then,  for  any  c  £  Rn,  the  k-best  problem  can  be 
solved  in  polynomial  time  on  k  and  n. 


Proof.  For  each  i  =  1, . . . ,  k,  solve  the  problem 

(Pi)  min  cTx 
s.t.  x  £  Pi, 


where  Pi  :=  P,  Pi  :=  forb(Pi_i,  {r^-i})  =  forb(P,  {vi, . . . ,  i>j_i})  for  i  =  2, . . . ,  k,  and  Vi  £  vert(Pi)  is 
an  optimal  solution  to  (Pi),  if  one  exists,  for  i  =  1 , ...  ,k.  From  Theorem Qbl  we  can  solve  each  of  these 
problems  in  polynomial  time.  In  particular,  if  (Pi)  is  infeasible,  we  return  v%, ... ,  vt-  \ .  Otherwise,  by 
construction,  v\ , . . . ,  Vk  satisfy  the  required  properties.  Clearly,  the  construction  is  done  in  polynomial 
time.  □ 

The  above  complexity  result  was  originally  obtained  in  fTOll  building  on  ideas  from  (16]]  by  applying  a 
branch-and-fix  scheme. 


4.2  Binary  all-different  polytopes 

With  edge-coloring  of  graphs  in  mind,  the  binary  all-different  polytope  has  been  introduced  in  fll|.  It 
was  furthermore  studied  in  [14]  and  [13].  We  consider  a  more  general  setting. 

Definition  23.  Given  a  positive  integer  k,  nonempty  0-1  polytopes  I\ .....  Pk  in  1",  and  vectors  a,  ...,Ck  £ 
R”,  the  binary  all-different  problem  is  to  solve 

(P)  min  Et=i  cJxi 

s.t.  Xi  £  vert(Pi)  i  =  l,...,k 

Xi  ^  Xj  1  <  i  <  j  <  k. 


In  lilli.  it  was  asked  whether  the  above  problem  is  polynomially  solvable  in  the  case  Pi  =  [0,  1 1 71  for  all 
i  =  1, . . . .  k.  Using  the  tractability  of  the  k- best  problem,  we  give  a  positive  answer  even  for  the  general 
case  of  distinct  polytopes. 

Given  a  graph  G  =  (V,  E)  and  U  C  V,  a  (/-matching  in  G  is  a  matching  MCE  such  that  each  vertex 
in  U  is  contained  in  some  element  of  M. 
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Theorem  24.  If  Pi  C  R"  is  a  tractable  nonempty  0-1  poly  tope  for  i  =  then  the  binary  all-different 

problem  is  polynomially  solvable. 


Proof.  For  each  i  =  1, . . . .  k,  let  .S',  be  the  solution  set  of  the  A: -best  problem  (Def.  [2TJ  for  I\  and  c, . 
Observe  that  1^ |  <  k.  Now,  consider  the  bipartite  graph  G  =  (S  U  R,E),  where  S  :=  IJ-=1  S',  and 
R  :=  {1, . . . ,  k } .  For  each  v  G  S  and  i  €  R,  we  include  the  arc  {t\  *}  in  E  if  and  only  i f  v  G  S, .  Finally, 
for  each  {u,  i}  G  E,  we  set  wVi  :=  cj v. 

We  claim  that  (V)  reduces  to  finding  an  //-matching  in  G  of  minimum  weight  with  respect  to  w.  It 
is  straightforward  to  verify  that  an  //-matching  in  G  defines  a  feasible  solution  to  (V)  of  equal  value. 
Thus,  it  is  enough  to  show  that  if  (fP)  is  feasible,  then  there  exists  an  /i-matching  with  the  same  optimal 
value.  Indeed,  let  {x-\ , . . . ,  Xk)  be  an  optimal  solution  to  CP)  that  does  not  define  an  /i-matching,  that 
is,  such  that  Xi  f  Si  for  some  i  =  1, . . .  ,k.  Then,  we  must  have  |vert(Pi)|  >  k  and  |5i|  =  k.  This  latter 
condition  and  Xi  f  Si  imply  the  existence  of  v  G  .S’,  such  that  v  f  x:j  for  all  j  =  1, . . . ,  k.  Furthermore, 
by  the  definition  of  Si,  we  also  have  cj v  <  cj x,.  Therefore,  the  vector  (x\, . . . ,  x,_i .  v.  x,+i , . . . ,  xf)  is 
an  optimal  solution  to  CP)  having  its  i-th  subvector  in  S, .  Iteratively  applying  the  above  reasoning  to 
all  components,  we  obtain  an  optimal  solution  to  (V)  given  by  an  //-matching  as  desired.  □ 


5  Extension  to  integral  polytopes 


In  this  section,  we  generalize  the  forbidden-vertices  problem  to  integral  polytopes,  that  is,  to  polytopes 
having  integral  extreme  points,  even  allowing  the  removal  of  points  that  are  not  vertices.  We  show  that 
for  an  important  class  of  integral  polytopes  the  resulting  problem  is  tractable. 

For  an  integral  polytope  P  C  R"  and  ICP  n  IP,  we  define  forb/(P,  X)  :=  conv((P  D  IP)  \  X). 

Definition  25.  Given  an  integral  poly  tope  P  C  R”,  a  set  X  CPnZ"  of  integral  vectors,  and  a  vector  c  G  R™, 
the  forbidden-vectors  problem  asks  to  either  assert  (P  n  IP)  \X  =  0,  or  to  return  a  minimizer  of  cTx  over 
(P  n  IP)  \  X  otherwise. 

Given  vectors  l,u  £  R™  with  l  <  u,  we  denote  [/,  u]  :=  {x  G  R”|  f  <  x,  <  Ui,i  =  1, . . . ,  n}.  We  term 
these  sets  as  boxes. 

Definition  26.  An  integral  polytope  P  C  R™  is  box-integral  if  for  any  pair  of  vectors  l,u  £lP  with  l  <  u,  the 
polytope  P  D  [/,  u ]  is  integral. 

Polytopes  defined  by  a  TU  matrix  and  an  integral  right-hand-side,  or  by  a  box-TDI  system,  are  exam¬ 
ples  of  box-integral  polytopes.  Further  note  that  if  P  is  tractable  and  box-integral,  so  is  P  fl  [/,  it] .  When 
both  conditions  are  met,  we  say  that  P  is  box-tractable. 

With  arguments  analogous  to  that  of  the  0-1  case,  we  can  verify  the  following  result. 

Theorem  27.  If  P  C.  R"  is  a  box-tractable  polytope,  then,  given  a  list  X  C  P  n  IP,  the  forbidden-vectors 
problem  is  polynomially  solvable.  Moreover, 

xc(forb/(P, X))  <  2n|X|(xc(P)  +  1). 


Proof.  Since  P  is  bounded,  it  is  contained  in  a  box.  Without  lost  of  generality  and  to  simplify  the 
exposition,  we  may  assume  that  PC  [0,  r  —  l]n  for  some  r  >  2.  As  in  the  0-1  case,  we  first  address 
the  case  P  =  [0,r  —  l]n,  for  which  we  provide  two  extended  formulations  for  forb/(P,  X)  involving 
0(n|X|)  variables  and  constraints. 
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The  first  extended  formulation  relies  on  the  mapping  <j>(x)  :=  Y^i=i  t,_1  x>  f°r  x  £  [0.  r  1]",  which 
defines  a  bisection  with  {0, . . . ,  rn  —  1}.  Letting  Kr(a ,  b)  :=  {x  £  {0, . . . ,  r  —  1}"|  a  <  cj)(x)  <  b},  we 
have  that  forb /  ( P.  A)  is  the  convex  hull  of  the  union  of  at  most  \X\  +  1  sets  of  the  form  Kr(a,  b).  Since 
conv(/vr(a.  b))  has  O(n)  facets  0/  by  disjunctive  programming  we  obtain  an  extended  formulation  for 
forb /(P,  X)  having  0(n\X\)  inequalities. 

For  the  second  extended  formulation,  let  X'  denote  the  projection  of  X  onto  the  first  n  —  1  coordinates 
and  set  X  :=  (X1  x  {0, . . . ,  r  —  1})  \  X.  Along  the  lines  of  Proposition^  we  have 

{0, . . . , r  —  1}"  \X  =  [({0, . . . , r  -  I}""1  \ X')  x  {0, . . . , r  —  1}]  U  A'. 

Although  X  can  have  up  to  r|A|  elements,  we  also  see  that  X  is  the  union  of  at  most  2 1 A’  sets  of  the 
form  v  x  {a, . . . ,  /3}  for  v  £  X'  and  integers  0  <  a  <  3  <  r  -  1 .  More  precisely,  for  each  v  £  A',  there 
exist  integers  0  <  a\  <  <  /3%  <  •  •  •  <  otvqv  <  f3qv  <  r  —  1  such  that 

Qv 

x=  U  Uw XK, ■••,$’} 

vex'  i=i 


and  ~}Zv(zx'  Qv  —  2|  A|.  Therefore,  conv(A)  can  be  described  with  0(|A|)  inequalities.  Then  a  recursive 
construction  of  an  extended  formulation  for  forb j(P,  X)  is  analogous  to  the  binary  case  and  involves 
0(n |A|)  variables  and  constraints. 

In  order  to  address  the  general  case,  we  first  show  how  to  cover  {0, . . . ,  r  —  1  }n  \  A  with  boxes.  For  each 
i  =  let  X1  be  the  projection  of  A  onto  the  first  i  components  and  let  A*  :=  (A’'~  1  x  {0, . . . ,  r  — 

1})  \  A1,  where  A1  :={0,...,r  —  ljyA1.  Working  the  recursion  backwards  yields 

n 

{0,...,r-l}"\A=U  [^ix{0)...,r-l}n"i'  . 

2=1 

Combining  the  last  two  expressions,  we  arrive  at 

n  qv 

{0, . . . ,  r  —  1}"  \  A  =  (J  U 

i=i  vex*-1  z=i 

The  right-hand-side  defines  a  family  B  of  at  most  2n|A|  boxes  in  K",  yielding 

{0, . . .  ,r  —  1}"  \  A  =  |J  [l,u]r  Zn. 


Finally,  if  P  C  [0,  r  —  l]n,  then 

(Pnz“)\A  =  (Pnz“)n({o,..,r-i}"\A)=  (J  Pn[i,u]nz" 

[z,«]ee 

Moreover,  if  P  is  box-tractable,  then 

forb/(P,  A)  =  conv  I  conv  ( P  fl  [l,  u]  IT  Zn)  J  =  conv  I  [J  P  IT  [l,  u]  J  , 

\[I,u]6B  /  \[Z,u]eB 


where  each  term  within  the  union  is  a  tractable  set. 


□ 
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The  fc-best  problem  and  the  binary  all-different  problem  can  be  extended  to  the  case  of  integral  vectors 
as  follows. 

Definition  28.  Given  a  nonempty  integral  polytope  P  C  Rn,  a  vector  c  G  R”,  and  a  positive  integer  k,  the 
integral  k-best  problem  is  to  either  assert  \P  n  Z"|  <  k  and  return  P  n  IP ,  or  to  return  v\  ,...,Vk  G  P  l~l  IP, 
all  distinct,  such  that  max{cTn;|  i  =  1, . . . ,  k}  <  min{cTi>|  v  G  (P  D  IP)  \  {v\, . . . ,  r’fe}}. 

Definition  29.  Given  a  positive  integer  k,  nonempty  integral  polytopes  Pi .... .  Pk  in  Rn,  and  vectors  ci,...,Ck  G 
R",  the  integral  all-different  problem  is  to  solve 

(P)  min  Jfi=i  CJ xi 

s.t.  ijGPnZ"  i  =  l,...,k 

Xi  Xj  \  <  i  <  j  <  k. 


The  above  problems  can  be  shown  to  be  polynomially  solvable  if  the  underlying  polytopes  are  box- 
tractable. 
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Abstract 

We  consider  the  integer  L-shaped  method  for  two-stage  stochastic  integer  programs.  To  improve 
the  performance  of  the  algorithm,  we  present  and  combine  two  strategies  that  deal  with  different 
aspects  of  the  algorithm.  First,  to  avoid  time-consuming  exact  evaluations  of  the  second-stage  cost 
function,  we  propose  a  simple  modification  that  alternates  between  linear  and  mixed-integer  sub¬ 
problems.  Then,  to  better  approximate  the  shape  of  the  second-stage  cost  function,  we  present  a 
general  framework  to  generate  optimality  cuts  via  a  cut-generating  linear  program  which  considers 
information  from  all  solutions  found  up  to  any  given  stage  of  the  method.  In  order  to  address  the  im¬ 
pact  of  the  proposed  approaches,  we  report  computational  results  on  two  classes  of  stochastic  integer 
problems. 


1  Introduction 


In  this  work  we  consider  mixed-integer  programs  of  the  form 

(IP)  min  cx  +  dz  +  9 

x,z,6 

s.t.  Ax  +  Cz  <  b 

Q(x)-e<  o 
xe  {o,i}” 
z  >  o,  z  e  z. 


(1) 

(2) 

(3) 

(4) 


where  Z  is  a  mixed-integer  set  and  Q(x)  is  a  real-valued  function  taking  a  binary  vector  x  as  argument. 
We  say  that  (x*,z*,  6* )  is  a  candidate  solution  if  (x*,z* )  satisfies  Q,  pi,  and  j4).  If  in  addition  Q  holds, 
then  we  say  (x* ,z* ,  6*)  is  a  feasible  (candidate)  solution.  Constraint®  together  with  the  presence  of  9 
in  the  objective  function  ensures  8  =  Q(x)  is  satisfied  by  any  optimal  solution  to  (IP).  A  fundamental 
assumption  is  that  given  x,  Q(x)  can  be  computed  with  a  reasonable  amount  of  effort. 


In  the  context  of  two-stage  stochastic  integer  programming,  we  usually  have 


Q(x)  := 


min  {qy  :  Wy  =  h  —  Tx,  y  £  Y} 


which  denotes  the  expected  second-stage  cost  of  x  with  respect  to  the  random  data  £  =  ( q,W,T,h ). 
We  assume  that  Y  imposes  some  integrality  requirements  on  y.  When  £  has  a  finite  set  of  possible 
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outcomes,  we  have  Q(x)  =  PcQ^(x),  where  Qg(x)  denotes  the  optimal  second-stage  value  of  the 
scenario  associated  to  £,  and  p?  is  the  probability  of  occurrence  of  £.  Thus,  (IP)  can  be  cast  as  a  large- 
scale  mixed  integer  program.  When  the  burden  of  solving  (IP)  is  mainly  due  to  the  presence  of  a  large 
number  of  scenarios,  schemes  similar  to  Benders'  decomposition  |4j  and  the  L-shaped  method  ll6l  can 
be  effective.  The  idea  is  to  relax  (J2J1  and  consider  6  as  an  underestimator  of  Q(x),  and  successively  add 
cuts  in  the  (x,  (?) -space  to  better  approximate  the  shape  of  Q(x).  This  is  done  until  an  optimal  solution 
( x*,z*,6 *)  satisfying  6*  —  Q(x*)  is  found.  When  the  second-stage  problem  is  a  linear  program,  Q(x)  is 
convex  in  x  and  thus  can  be  approximated  by  subgradients  using  optimal  dual  solutions.  In  contrast, 
when  the  second-stage  problem  is  a  mixed-integer  program,  such  a  nice  property  does  not  hold,  and 
moreover,  Q(x)  can  even  be  discontinuous.  Thus,  the  decomposition  approaches  of  the  linear  case 
have  to  be  modified  to  accommodate  integer  variables  in  the  second  stage.  In  j8j,  such  a  modification, 
the  integer  L-shaped  method,  is  introduced.  It  is  designed  for  two-stage  stochastic  integer  problems 
having  binary  first-stage  variables  as  it  exploits  the  facial  property  of  0-1  sets.  More  generally,  the 
integer  L-shaped  method  can  be  applied  to  any  mixed-integer  problem  having  the  form  of  (IP)  as  long 
as  Q(x)  is  computable  from  binary  x.  In  particular,  it  also  fits  situations  where  Q(x)  can  be  evaluated 
with  a  closed-form  analytical  formula,  but  it  does  not  have  an  amenable  mixed-integer  formulation. 
Applications  of  this  method  include  vehicle  routing  JT1|,  j6j,  probabilistic  traveling  salesman  problems 
f9l,  location  problems|10J,  and  generalized  assignment  |T|,  among  others. 

Next  we  describe  the  integer  L-shaped  method.  Let  X  be  the  projection  of  the  feasible  region  of  (IP) 
onto  the  x-space,  and  let  L  G  Rbea  lower  bound  on  Q(x)  over  X.  Then  (IP)  can  be  equivalently 
formulated  as 


(MP)  min  cx  +  dz  +  9 
s.t.  Ax  +  Cz  <  b 

Tlx  —  10  <  tzq  (5) 

xg  {0,1}” 

z  >  0,  z  e  z 
9  >  L, 

where  1  denotes  a  vector  of  ones  of  appropriate  size,  as  long  as  for  each  x*  £  X  constraints  ([5j  include 
a  cut  of  the  form  nkx  —  6  <  tzq  such  that  nkx  —  tIq  <  Q(x)  for  all  x  G  X  and  nkx*  —  7Tq  =  Q(x*).  In 
other  words,  the  affine  function  rzkx  —  7Tq  underestimates  Q(x)  on  X,  and  the  estimate  is  tight  at  x*. 
The  optimality  cuts  of  Laporte  and  Louveaux  (51  define  such  a  cut  family  and  form  the  basis  of  the 
integer  L-shaped  method. 

Given  x*  G  {0, 1 }”,  let  S(x*)  :=  { i  :  x*  —  1}.  In  the  (standard)  integer  optimality  cut  at  x*  is 
defined  as 

0>(Q(x*)  — L)(  £  *«-  £  xf-|S(x*)|  )  +Q(x*).  (6) 

\ieS(x*)  HS(x*)  ) 

LetAY«(x)  :=  |S(x*)|  —  D;gs(x*)  xi  +  ^iis(x*)  xi  be  the  Hamming  distance  between  x  and  x* ,  and  note 
that  0  <  Av*  (x)  <  n  with  AY*  (x)  —  0  if  and  only  if  x  =  x*.  Thus,  if  x  =  x* ,  then  the  right-hand  side 
of  l[6j  attains  its  maximum  value  Q(x*).  If  x  G  {0, 1 }"  \  {x*},  then  it  takes  a  value  less  than  or  equal 
to  L.  Since  9  >  L,  combining  both  cases,  we  have  that  ([6l  models  the  implication  x  =  x*  =>  e>Q(x). 
Observe  also  that  as  we  have  one  cut  per  element  in  X,  (p|  might  have  exponentially  many  constraints. 
Thus,  ([5}  is  omitted  from  the  initial  formulation  (MP)  and  cuts  (J6jl  are  added  on-the-fly  as  new  solutions 
are  discovered. 

It  is  important  to  keep  in  mind  that  given  the  enumerative  nature  of  <J6]),  in  practice  these  cuts  are 
complemented  with  other  inequalities  that,  albeit  not  tight,  help  to  improve  the  global  lower  bound 
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on  Q(x).  When  Q(x)  is  the  expected  second-stage  value  of  x  given  by  the  value  function  of  a  mixed- 
integer  program,  the  most  obvious  inequalities  to  add  are  the  subgradient  cuts  given  by  the  continuous 
relaxation  Qlp(x)  of  Q(x).  They  have  the  form 

9  >  s(x  —  x*)  +  Qlp(x*),  (7) 

where  s  is  a  subgradient  of  Qpp(x)  at  x*. 

An  implementation  of  the  integer  L-shaped  method  with  a  current  state-of-the-art  solver  works  as 
follows.  Having  computed  a  lower  bound  L  on  Q(x)  and  solved  the  continuous  relaxation  of  (IP)  with 
Benders'  decomposition,  we  end  up  with  a  linear  master  problem  that  includes  subgradient  cuts  of  the 
form  10.  Then  we  reinforce  the  binary  requirements  on  x  and  any  integrality  restrictions  on  z,  leading 
to  a  mixed-integer  master  problem  of  the  form  (MP),  but  where  the  system  0  is  a  relaxation  of  (IP),  so 
that  an  optimal  solution  to  the  current  problem  may  not  be  feasible  to  (IP).  The  idea  now  is  to  solve  the 
mixed-integer  master  problem  in  a  way  such  that  all  integer  solutions  are  checked  against  feasibility 
with  respect  to  (IP)  before  being  accepted  as  an  incumbent.  For  this,  the  solver  proceeds  in  a  similar 
fashion  to  branch-and-cut,  that  is,  it  generates  a  search  tree  by  solving  linear  subproblems,  branching, 
and  adding  cutting  planes.  The  main  difference  is  that  when  a  candidate  integer  solution  (x*,z*,8*) 
satisfying  §,0  and  0  is  found  at  a  node  of  the  search  tree,  an  additional  routine,  the  so-called 
optimality  cut  function,  is  called  in  order  to  assert  feasibility  and  add  optimality  cuts.  If  the  solution 
is  infeasible  to  the  true  problem  (IP),  i.e.,  6*  <  Q(x* ),  this  function  generates  an  optimality  cut  that  is 
applied  to  all  pending  nodes  in  the  master  problem  tree,  ensuring  that  this  solution  is  discarded.  Then 
the  solver  continues  exploring  the  tree  with  the  guarantee  that  any  discarded,  and  thus  infeasible, 
solution  will  not  appear  again.  If  the  solution  is  actually  feasible  to  (IP),  then  it  is  accepted  by  the 
optimality  cut  function  and  the  current  incumbent  is  updated  accordingly.  A  modern  implementation 
of  the  (standard)  integer  L-shaped  method  is  presented  in  Algorithm  [Tjbelow. 

Algorithm  1  Integer  L-shaped  method 

Input:  A,  C,  fr,  c,  d,  Q  :  X  ^  R,  QLP  :  X  ->•  R 
Output:  Optimal  solution  x*  to  (IP)  and  optimal  value 
l:  Compute  a  lower  bound  L  of  Q(x) 

2:  Solve  the  LP  relaxation  of  (IP)  with  Benders'  decomposition 
3:  Declare  x  variables  as  binary  in  master  problem 
4:  Initialize  the  optimality  cut  function 

5:  Solve  the  integer  master  problem  using  the  optimality  cut  function  to  assert  feasibility  of  solutions 
and  add  optimality  cuts 
6:  return  x*  and  optimal  value 


In  line|4]of  Algorithm[l]we  initialize  any  additional  structures  that  may  be  needed  by  the  optimality  cut 
function  before  invoking  the  solver  in  line  [5]  In  particular,  as  there  may  be  several  solutions  sharing 
the  same  x  subvector,  we  keep  a  list  V  of  first-stage  x  for  which  Q(x)  has  been  computed  to  avoid 
duplicate  evaluations.  In  a  standard  implementation,  the  optimality  cut  function  has  the  form  shown 
in  Algorithm|2] 

The  optimality  cut  function  returns  TRUE  if  the  candidate  integer  solution  is  indeed  feasible  to  (IP). 
Otherwise  it  returns  FALSE  to  reject  the  solution  and  apply  the  optimality  cut.  Note  that  the  steps  in 
lines  |C  and  |C  in  Algorithm  0  are  not  needed  for  convergence  of  the  method,  but  help  to  improve  the 
global  lower  bound  on  Q(x). 

The  optimality  cut  0  relies  on  exact  evaluations  of  Q(x),  which  can  be  very  time-consuming  in  the 
case  where  Q(x)  is  given  by  a  complicated  mixed-integer  program.  Also,  observe  that  0  depends  on 
x*  and  Q(x*)  only,  i.e.,  it  only  depends  on  the  point  to  be  cut-off.  In  particular,  it  does  not  take  into 
account  the  information  provided  by  other  solutions  that  we  may  have  found  while  exploring  the  first- 
stage  set.  To  improve  the  performance  of  the  integer  L-shaped  method,  we  propose  two  approaches  to 
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Algorithm  2  Standard  optimality  cut  function 

Input:  (x*,z*,  8* )  candidate  integer  solution,  Q  :  X  — »  ]R,  Qip  :  X  — »  ]R,  V 
Output:  true  if  solution  is  feasible,  false  otherwise 
l:  if  x*  E  V  then  / /  We  know  8*  >  Q(x*) 

2:  return  true 

3:  end  if 

4:  Compute  Qlp{x*) 

5:  Add  the  subgradient  cut  (|7| 

6:  Compute  Q(x*) 

7:  V  ^Vu{x*j 
8:  if  8*  <  Q(x*)  then 
9:  Add  the  integer  optimality  cut  (J6jl 

10:  return  false 

li:  else 

12:  return  true 

13:  end  if 


deal  with  the  above  issues.  First,  in  Sectionp]  we  present  a  simple  modification  that  alternates  between 
exact  and  approximate  evaluations  of  Q(xy  Then,  in  Section  [3|  we  introduce  of  a  new  type  of  opti¬ 
mality  cut  that  includes  information  obtained  from  different  solutions;  in  particular,  evaluations  and 
estimates  of  Q(x)  at  different  points.  These  new  cuts  are  obtained  through  a  cut-generating  linear  pro¬ 
gram  which  is  constructed  based  on  ideas  from  disjunctive  programming  and  the  forbidden-vertices 
problem  0.  Then,  in  Section  |4j  we  outline  an  implementation  that  combines  both  modifications  in  a 
single  method.  Finally,  in  Section[5j  we  present  computational  results  of  the  proposed  variants  on  two 
classes  of  stochastic  integer  programs. 


2  Alternating  cuts 


In  this  section  we  present  a  simple  cut  strategy  to  decrease  the  overall  effort  incurred  in  computing  the 
function  Q(x). 

Suppose  that  while  solving  (IP)  with  the  integer  L-shaped  method,  a  candidate  solution  (x*,z*,8*) 
has  been  found  along  the  search  tree  of  (MP).  Recall  that  we  reject  the  solution  if  8*  <  Q(x*).  Since 
Qlp(x)  <  Q(x),  a  sufficient  condition  to  reject  ( x*,z*,8 *)  is  8*  <  Qlp(x*).  Given  that  Qlp(x)  is 
convex,  we  have  that  the  subgradient  cut  (pi  is  a  valid  inequality  that  cuts  off  the  pair  (x*,8*)  in 
the  (x,  ff)-space.  Therefore,  instead  of  evaluating  Q(x*)  exactly,  we  first  evaluate  Qi,p(x*  )  and  check 
whether  8*  <  Qlp(x*).  If  so,  we  add  the  subgradient  cut  (Jt|  to  remove  (x*,0':  ).  Otherwise,  we  compute 
Q(x*)  and  check  whether  8*  <  Q(x*).  If  so,  we  add  the  integer  optimality  cut  (JbJ.  Otherwise,  we 
accept  the  solution.  The  key  idea  is  to  use  Qlp{x)  as  a  proxy  for  Q(x)  to  check  feasibility  of  a  candidate 
solution,  preventing  unnecessary,  and  more  costly,  computations  of  Q(x). 

The  modification  just  described  is  similar  in  spirit  to  sequential  approximation  schemes  such  as  jl3 , 
ei,  m,  and  M  ,  where  the  second-stage  cost  function  Q(x)  is  approximated  by  linear  programs  which, 
starting  with  Qlp(x),  are  iteratively  strengthened  with  additional  cuts.  Although  these  methods  are 
shown  to  converge  after  a  finite  number  of  steps,  the  convergence  can  be  very  slow  and  in  practice 
exact  evaluations  of  Q(x)  may  be  required.  In  contrast,  in  the  context  of  the  integer  L-shaped  method, 
we  propose  to  use  Qlp{x)  as  the  unique  intermediate  approximation  for  Q(x),  which  is  a  simple  yet 
useful  modification  whose  implementation  is  rather  straightforward  and,  to  the  best  of  our  knowledge, 
has  not  been  reported  in  the  literature. 
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To  implement  the  approach  presented  above,  in  addition  to  V ,  we  also  keep  a  list  Vpp  of  visited  first- 
stage  solutions  x  for  which  the  continuous  relaxation  Qlp(x)  has  been  computed.  The  modified  strat¬ 
egy,  which  we  call  alternating  cuts,  proceeds  as  shown  in  Algorithm  [3] 


Algorithm  3  Optimality  cut  function  with  alternating  cut  strategy 

Input:  (x*,z*,  9* )  candidate  integer  solution,  Q  :  X  — »  ]R,  Qpp  :  X  — >  1R,  V,  Vpp 
Output:  true  if  solution  is  feasible,  false  otherwise 
l:  if  x*  £  V  then  / /  We  know  9*  >  Q(x*) 

2:  return  true 

3:  end  if 

4:  if  x*  £  Vpp  then 
5:  Compute  Qt p(x*) 

6:  VLP  <-  VLP  U  {x*} 

7:  if  9*  <  Qlp(x*)  then 

8:  Add  the  subgradient  cut  j7|. 

9:  return  false 

10:  end  if 

li:  end  if 

/  /  Now  we  have  x*  £  Vpp  and  6*  >  Qlp{x*) 

12:  Compute  Q(x*). 

13:  V  ^VU{x*j 

14:  if  9*  <  Q(x*)  then 

15:  Add  the  integer  optimality  cut  (j6j 

16:  return  false 

17:  else 

18:  return  true 

19:  end  if 
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Note  that  if  x*  £  Vpp  satisfies  |7|,  then  x*  is  included  into  Vj  p  and  thus  the  steps  in  lines 
Algorithm [3] are  applied  to  check  whether  (x* ,z* ,  9*)  is  a  feasible  solution  or  not.  As  we  shall  see  in 
Section  [5j  this  simple  modification  yields  speedups  of  one  order  of  magnitude  on  instances  from  the 
literature. 


3  New  optimality  cuts 


In  this  section,  we  present  a  new  class  of  integer  optimality  cuts  that  can  be  used  as  an  alternative 
to  the  standard  cut  <[6j.  After  providing  an  overview  of  the  approach,  we  show  how  to  construct  a 
cut-generating  linear  program  to  separate  these  new  inequalities  and  then  we  discuss  some  implemen¬ 
tation  details.  In  this  section  we  denote  conv  (T)  the  convex  hull  of  a  set  T  of  real  vectors. 

Let  S  be  the  projection  of  the  feasible  set  of  (MP)  onto  the  (x,  0)  -space,  which  corresponds  to  the  epi¬ 
graph  of  Q(x)  over  X,  i.e., 

S  =  {(x,0)  eXxR:  9  >  Q(x)}. 

Let  V  C  X  be  such  that  Q(x)  is  known  for  all  x  G  V.  We  have 

S  C  S(X,V)  :=  U(M):  fi>QW}U(X\V)x{»:  9  >  L}. 

xev 


In  some  sense,  S(X,  V)  is  the  best  approximation  of  S  when  only  the  values  of  Q(x)  for  x  £  V  are 

5 

DISTRIBUTION  A:  Distribution  approved  for  public  release. 


known  and  only  the  trivial  lower  bound  L  is  available  over  X  \  V.  We  consider  the  relaxation  S(V)  of 
S(X,V)  given  by 


S(X,V)  C  S(V)  :=  U(M):  fi>QW}U({0,l}"\V)x{0:  6  >  L}. 

xev 


Figure  |l|  illustrates  an  example  with  x  G  {0, 1  }2  and  b  =  j  J  j  ,  ^  ]  j  j.  The  bold  dots  represent  the 

values  of  Q(x)  that  are  known  depending  on  whether  x  belongs  to  b  or  not.  Then  S(V)  is  given  by  the 
union  of  the  vertical  rays. 
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Figure  1:  S(V). 

Observe  that  S(b)  C  S(U)  for  any  U  C  b,  and  in  particular,  S(b)  C  S({x})  for  x  G  V .  Moreover, 

S(b)  —  fhcv  S({x}).  Since  (|6j  is  a  valid  inequality  for  conv  (S({x})),  it  is  also  valid  for  conv  (S(b)). 
Actually,  (J6l  is  the  only  nontrivial  cut  needed  to  describe  conv  ( S  ({  x  } )) .  However,  in  general,  conv  (S(b))  C 
n.tgy  conv  f  S  ( { i' } ) )  holds  with  strict  containment,  i.e.,  adding  ||6l  for  all  x  G  V  does  not  yield  conv  (S(b)j. 
Our  goal  is  to  derive  a  compact  extended  formulation  for  conv  (S(V))  and  use  it  to  generate  optimality 
cuts  for  a  point  ( x*,9 *)  in  the  (x, d) -space  that  take  into  account  the  values  of  Q(x)  for  x  G  V.  Fig¬ 
ure  15]  shows  how  this  information  can  improve  our  approximation  of  the  convex  hull  of  the  epigraph 
of  Q(x). 

Several  steps  of  the  construction  of  our  cut-generating  linear  program  rely  on  Lemma  [l]  below,  which 
follows  from  disjunctive  programming  |j3)  applied  in  the  context  of  linear  extended  formulations  of 
polyhedra. 

Lemma  1.  Let  Pi, . . . ,  be  nonempty  polyhedra  in  IR"  having  the  same  recession  cone.  If  Pi  —  {x  G  1R”  |  G 
Rmi  :  EjX  +  Fiyi  >  hi},  then  conv  ^U^=1P,J  —  {x  G  1R”|  3x,-  G  IR”,  y,  G  IR'"',  A  G  IR,f  :  x  — 

E/=i  xu  Eixi  +  FiVi  >  KK  Eti  A'  =  !/  A  >  °}- 
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A  A 


(a)  The  standard  cut  at  x  =  (1, 1)  (light  gray) 
can  be  strengthened  if  Q(l,  0)  is  known 
(dark  gray). 


(b)  The  description  of  the  solutions  that  are  not  in  V 
given  by  standard  cuts  (gray  triangle)  can  be  made 
exact  (bold  segment). 


Figure  2:  Improving  the  description  of  conv  (S(V)). 


3.1  Construction  of  CGLP 

Clearly,  we  have  conv  (S(V))  =  conv  (Pq(V)  U  Pl(V)),  where 

Pq(V)  :=  conv  (  (J  {(*,0)  :  9  >  Q(x)} 
\xev 


and 

Pl(V’)  :=  conv  ({0,1}"  \  V)  x  {9:  9>L}. 

Thus  to  describe  conv  (Sfl7))  it  suffices  to  provide  compact  extended  formulations  for  Pq(V)  and 
Pl(V)  and  then  apply  disjunctive  programming  to  their  union  as  illustrated  in  Figure [3] 

Describing  Pq(V)  is  trivial:  letting  V  =  {x1, . . .  ,xtj,  then  Pq(V)  is  the  set  of  vectors  (x^,  9®)  £  R”  x  R 
for  which  there  exists  tp  €  Rf  satisfying 


—  X®  +  Y2  (psXS  —  0 

S=1 

-9Q  +  J2^sQ(xs)  <0 

S—l 
= 1 

s—l 

<p>  0. 


To  describe  Pi(V),  it  is  enough  to  describe  conv  ({0, 1}”\V)  and  then  take  the  Cartesian  product  with 
{9  :  9  >  L}.  We  build  on  results  from  the  forbidden-vertices  problem  [21  to  do  this. 
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Figure  3:  conv  (S(V))  —  conv  (Pq(V)  U  Pl{V)). 


Let  V1  be  the  projection  of  V  onto  the  first  i  coordinates.  Define  V1  {0,1}  \  V1,  V1  [ V1  1  x 

{0, 1}]  \  V1  C  {0, 1}'  for  i  >  2,  and  write  V1  —  {v\, . . .  ,»}.}•  Finally,  for  all  i ,  let  W’i  V1  x  {0}1_!  — 

{wf, . . . ,  w‘^  }  C  {0,1  }i  for  all  j  >  i  and  define  W‘  Wm  —  {zv'lr . . . ,  }  C  {0,  l}’1. 


From  0,  for  all  1  <  j  <  n  —  1  we  have 


{o, i y+1  \  v’+1  =  ({o,iy  \  W)  x  {o,  i}  u  v’+1. 


(8) 


The  idea  behind  ^  is  that  any  vector  in  {0, 1  1  \  y/+1  is  such  that  either  its  projection  onto  {0,  l}-' 

does  not  lie  in  V>  or  it  is  obtained  by  flipping  the  value  of  the  last  component  of  a  vector  in  1/1+1 
otherwise. 


Example  2.  Consider  n  =  3  and  1/=|^  l  ^  ^  l  ^  ^  o  ^j.  For  j  —  2,  zve  have  i  o  )} 

and  therefore  {0,  l}2  \^2~{(  o  )'(  l  )}•  Clearly,  any  vector  in  {0,  l}3  zvhose  projection  onto  {0,  l}2  lies 
outside  V 2  must  belong  to  {0,  l}3  \  V.  Hence  [{0,  l}2  \  V 2]  x  {0, 1}  = 

{0,  l}3  \  V.  On  the  other  hand,  we  have  V  C  V2  x  {0, 1}  — 


l/3  =  [V2  x  {0,1}]  \  V  = 


C  {0,  l}3  \  V.  Then  zve  can  verify  that  (S'  holds  for  j  —  2,  i.e., 


{0,  l}3  \  V  -  [({0,  l}2  \  V2)  x  {0, 1}]  U  l/3. 


i)}£ 

,  and  thus 


We  use  the  recursion  ^  to  derive  an  explicit  extended  formulation  for  conv  ({0, 1}"  \  V)  having  0(n\V\) 
variables  and  constraints. 

Proposition  3.  For  all  2  <  j  <  n,  conv  ({0, 1}!  \  W)  is  given  by  all  x  €  W  for  which  there  exist  vectors  y,  A, 
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and  }i  satisfying 


j  h  .  .. 

-^+y  +  EE^!i  =  0 

i=l  1=1 

h 

Lri -a!  =  0 

1=1 

A 

E  V-\  +  A;_  i  —  A,-  =  0  V2  <  i  <  j  —  1 
1=1 

ki  . 

E  A + A/- 1 =  1 

;=i 

3/i  =  0 

i /j  —  A ;■_!  <0  V2  <  i  <  j 
y  >  0,  A  >  0,  y  >  0. 

Proof.  We  apply  induction  on  2  <;<«.  The  base  case  reduces  to  proving  that  conv  ({0,  l}2  \  V2)  is 
given  by 

2  kj 

—x  +  y  +  EE  ilizv\2  —  0 

i=iz=l 

h 

E  ?z  -  At  = 0 

Z=1 

k2 

E/'/+Ai-l  (9) 

1=1 

yi  =  o 

3/2  ~  Ai  <  0 

3/  >  0,  A  >  0,  y  >  0. 


Indeed,  from  §),  we  have 


{0,1}2\  V2 


({0/1}1  \  x  {0,1} 


UF. 


By  definition,  we  have  W12  —  V1  x  {0}  =  ({0, 1}  \  V1)  x  {0}.  Then  observe  that 

({O,!}1^1)  x{0,l}  =  W12  + °Q  ),(  J  )}, 


(10) 


and  thus 

conv  ^  ^{0,  l}1  \  V1)  x  { 0, 1 } ^)  =  conv  ^W12^j  +  jy  £  IR2  :  3/1  =  0,  0  <  1/2  <  1  j . 


Writing  W12  —  {icj2, . . . ,  icj[2},  it  follows  that  conv  ( ({0,  l}1  \  V1)  x  {0, 1})  is  given  by  p  £  IR2  such 
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that 


h 

-p+y  +  E/'W2  = 0 

1=1 

h 

E/‘/= 1 

;=i 

yi  =  o 

y2  <  i 

y  >  0,  fz1  >  0. 

We  also  have  by  definition  V2  —  W22  =  {zr22, . . . ,  zw22},  and  thus  conv  ( V2)  is  given  by  q  £  ]R2  such 
that 

-9  +  E  }llw?  =  0 

1=1 

h 

Li4  =  i 

i=i 

f!2  >  0. 


From  (Ip),  we  apply  Lemma  [T]  to  the  above  polytopes:  we  introduce  a  multiplier  0  <  A  ]  <  1,  we 
include  the  equation  x  —  p  +  i),  and  we  multiply  the  right-hand-side  vectors  of  the  first  and  second 
systems  by  A|  and  1  —  Aj,  respectively  After  eliminating  p  and  q,  we  immediately  obtain  the  desired 
system  §  for  conv  ({0,  l}2  \  V2). 

Now,  assuming  that  the  claim  holds  for  some  2  <  j  <  n  —  1,  we  will  prove  that  it  also  holds  for  j  +  1. 
Since  conv  (({0,  1}/  \  W)  x  {0, 1})  =  conv  (({0,  1}/  \  W))  x  [0, 1],  by  the  inductive  hypothesis,  we 
have  that  conv  (({0,  1}/  \  W)  x  {0, 1})  is  given  by  p  £  IR/+1  satisfying 

-p+y  +  ED'l^r1  =  ° 

i=l/=l 

h 

E/'Z  -At=0 

z=i 

ki 

E  +  Aj_i  —  A,-  =  0  V2  <  z  <  ;  —  1 
1=1 

ki  . 

E  A + A/-i =  1 

i=i 

3/1—0 

3//  —  Aj_x  <0  V2  <  i  <  j 
Vj+i  <  1 

3/  >  0,  A  >  0,  ;z  >  0, 


where  we  have  appended  a  new  variable  0  <  t/y_i  <  1  and  vectors  have  been  extended  to  w  j" 1  by 
appending  another  component  with  value  0. 
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We  also  have  that  conv  (VP+1)  is  given  by  q  £  IR-,+1  satisfying 


ki+i 

E  /<! 
1=1 


1=1 

J+1  - 


F 


ii+1  >  0. 


From  (jSj,  it  is  enough  to  apply  Lemma  |T]  to  the  above  polytopes  to  find  an  extended  formulation  for 
conv  ({0, 1 }  ,+ 1  \  W+1) .  Analogously  to  the  base  case,  we  introduce  a  multiplier  0  <  A;  <  1,  we  include 
the  equation  x  —  p  +  q,  and  we  multiply  the  right-hand-side  vectors  of  the  first  and  second  systems  by 
A j  and  1  —  Ay,  respectively.  After  eliminating  p  and  q,  we  immediately  obtain  the  desired  system  for 
conv  ({0, l}i+1  \  W+1).  □ 


From  Proposition |3j  we  obtain  that  conv  ({0, 1}”  \  V)  is  given  by  the  vectors  xL  £  1R"  such  that 

n  ki 

-xL+y  +  LL  f\w\  = 0 

i=i  i=i 

h 

Ef‘}~  Ai  =  0 

i=i 

h 

yj  p\  +  A;_i  —  —  0  V2  <  i  <  n  —  1 

l=i 

E  y'l  +  ^n~i  =  i 

i=i 

yi  =  o 

i h  —  A;_i  <0  V2  <  i  <  n 
y  >  0,  A  >  0,  p  >  0. 


Appending  the  variable  9L  and  the  constraint  9L  >  L  to  the  above  system  gives  an  extended  formula¬ 
tion  for  Pi(V).  Note  that  excluding  the  nonnegativity  restrictions,  the  constraint  matrix  has  3 n  rows 
and  3n  +  O (n  \  V \ )  columns,  i.e,  only  its  width  changes  with  V.  In  particular,  updating  the  formulation 
can  be  done  columnwise,  which  is  a  desirable  property  from  the  computational  point  of  view. 

Once  again,  we  apply  disjunctive  programming,  but  this  time  to  Pl{V)  and  Pq(V)  to  derive  an  ex¬ 
tended  formulation  for  conv  (S(V)).  Note  that  both  Pi{V)  and  Pq(V)  have  {(0, 9)  £  1R"  x  1R  :  9  >  0} 
as  their  recession  cone  and  thus  LemmaJTIapplies.  We  introduce  a  multiplier  0  <  5  <1,  we  include  the 
equations  x  —  xL  +  and  9  —  9L  +  9® ,  and  we  multiply  the  right-hand-side  vectors  of  the  systems 
defining  Pl{V)  and  Pq(V)  by  5  and  1  —  <5,  respectively. 

Recall  that  in  the  definition  of  S(V)  we  have  dropped  the  dependence  on  X.  To  recover  part  of  that 
information,  we  can  describe  a  polyhedron  that  lies  between  conv  (S)  and  conv  (S(V)).  For  that,  P;  ( V ) 
can  be  coupled  with  any  valid  inequality  for  (MP).  In  particular,  including  variables  z  >  0  and  the 
system  AxL  +  Cz  <  b  tightens  the  formulation.  Lower  bounds  of  the  form  rir1  —  1 9L  <  ttq  can  be 
useful  too  to  better  approximate  the  shape  of  the  epigraph  S  of  Q(x).  Thus  we  may  assume  that  both 
types  of  constraints  are  added  to  the  formulation  of  Pi(V),  and  that  9L  >  L  is  absorbed  in  TIxL  —  19L  < 

/ro¬ 
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Finally,  we  obtain  that  if  (x*  ,6*)  does  not  belong  to  conv  (S(V)),  and  thus  not  to  conv  (S),  then  the 
following  system  is  infeasible: 


(a) 

xL  +  X®  =  X* 

(|6) 

eL  +  eQ  =  e* 

n  k\ 

(?) 

-^+y  +  EE^h° 

i= i  i=i 

h 

(Pi) 

Ai=0 

i=i 

kt 

E  p\  +  A i_\  —  \i  —  0  V2  <  i  <  n 

l=i 

kn 

(Pi) 

( Pn ) 

E  F”  +  ^n-l  —  ^  —  0 

1=1 

(<Pl) 

yi  =  o 

( Vi ) 

y;  —  A;_i  <0  V2  <  i  <  n 

w 

Tlx1  —  1  6l  —  tzoS  <  0 

(v) 

+  Cz  —  W  <  0 

(7) 

t 

—x®  -|_  E  <^sxs  —  o 

S=1 

(T) 

-0Q  +  Z>SQ(*«)  <  0 

S=1 

Efc  +  *  =  1 

S=1 

y  >  o,  a  >  o,  p  >  o 

<p  >  0 
<5  >  0. 

(v) 

By  Farkas'  Lemma,  and  after  removing  redundancies,  we  arrive  at  the  alternative  system 

x*u  +  e*p  +  ri  <  0 
a  —  a  +  Arv  +  nTi/>  =  0 
/3  —  lip  =  0 

-| On  +  //  -  bv  -  riQip  >  0 

CTv  >  0 

Ci  +  <pi  >  0  2  <  i  <  n 

~Pi  +  Pi+i  +  <Pi+ 1  >  0  1  <  i  <  n  -  1 

w\a  +  pi  >  0  1  <  n,  1  <  1  <  ki 

xsa  +  Q{xs)p  +  i ]  >  0  l<s  <t 

P  >  0,  cp  >  0,  v  >  0,  ip  >  0. 

Thus,  any  feasible  solution  to  the  above  system  yields  an  inequality  ax  +  [56  >  —t]  that  is  valid  for 
conv  (S),  but  is  violated  by  ( x*,6 *). 
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For  finite  termination  of  the  integer  L-shaped  method,  we  need  a  tightness  condition  at  the  current 
solution,  namely  ocx*  +  f}Q(x*)  —  —rj.  Including  this  condition  yields  0  >  x*oc  +  9*f$  +  rj  —  x*oc  + 
[5Q(x*)  + 1]  —  fSQ(x*)  +  0*j6  —  p(9*  —  Q(x*)).  Since  9*  <  Q(x*),  we  conclude  that  fi  >  0  in  any  feasible 
tight  solution.  Therefore,  we  replace  the  condition  x*oc  +  9*  ft  +  rj  <  0  with  x*oc  +  Q(x*)fS  +  rj  —  0  and 
the  normalization  /3  =  1.  Note  that  the  objective  function  of  the  resulting  linear  program  is  fixed  to 
zero,  and  we  only  need  to  find  a  feasible  solution,  which  always  exists  by  definition  of  the  system;  in 
particular,  ([6])  is  feasible.  The  final  system,  denoted  CGLP,  reads 


oc  —  u  +  Ar  v  +  n  Tip 

= 

0 

w 

= 

1 

-pn  +rj  —  bv  —  7r0tp 

> 

0 

CTv 

> 

0 

G  +  (pi 

> 

0 

2  <  i  <  n 

Pi  +  Pi+ 1  +  (pi- 1-1 

> 

0 

1  <  i  <  n  -  1 

iv\cr  +  pi 

> 

0 

1  <  i  <  n,  1  <  1  <  ki 

(11) 

xsx  +  Q{xs)  +  1] 

> 

0 

1  <  s  <t 

(12) 

xtoc  +  Q(x‘ )  + 1] 

0 

(13) 

(p  >  0,  v  >  0,  ip  >  0. 

Having  set  x f  x* ,  we  solve  CGLP  to  find  a  feasible  solution  to  the  system.  In  particular,  we  obtain  a. 

and  rj  defining  a  CGLP-based  optimality  cut  of  the  form 

ocx  +  9  >  —  /;  (14) 


which  by  construction  cuts  off  (x*,6*)  with  9*  <  Q(x*). 


3.2  Implementation 


The  main  difference  that  we  are  proposing  with  the  standard  implementation  is  the  use  of  the  CGLP- 
based  cut  |l4j  in  place  of  (|6j.  This  requires  keeping  a  list  V  of  first-stage  solutions  for  which  Q(x)  has 
been  computed  and  updating  CGLP  accordingly.  Algorithm [4] shows  the  procedure. 


A  key  step  is  found  in  line  [7]  of  Algorithm  [4]  as  conv  (S(V))  has  to  be  recomputed  whenever  a  new 
vector  x*  is  added  to  V.  Of  course,  we  could  derive  CGLP  from  scratch  every  time.  Doing  so  requires 
computing  the  sets  Wl  and  thus  creating  O  (n\V\)  constraints  in  (11 1.  Instead,  we  propose  to  perform 
marginal  updates  from  an  iteration  to  the  next  one  using  the  fact  that  Wl  —  V'  X 


Let  V(  =  {x1, . . . ,  xf }  be  the  set  of  the  first  t  solutions  found  along  the  master  tree.  Similarly,  let  Vlt  be  the 
projection  of  Vt  onto  the  first  i  components  and  set  Vtl  :=  [V/-1  x  {0, 1}]  \  V't  with  l/fx  :=  {0, 1}  \  V}. 
Suppose  a  new  vector  xt+1  —  (x\ , . . .  ,xn)  is  to  be  included  and  let  Vt+\,  Vf+1,  Vf'+1  be  the  updated 
sets.  Let  x1  (x\, . . . ,  Xj)  and  xl  (x\, . . .  ,x,-_i,l  —  Xi).  Clearly,  we  have  Vf+1  —  VtU{xt+1} 
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Algorithm  4  Optimality  cut  function  with  CGLP-based  optimality  cuts 

Input:  (x*,z*,  8* )  candidate  integer  solution,  Q  :  X  — »  ]R,  Qip  :  X  — »  ]R,  V 
Output:  true  if  solution  is  feasible,  false  otherwise 
l:  if  x*  E  V  then  / /  We  know  8*  >  Q(x*) 

2:  return  true 

3:  end  if 

4:  Compute  Qlp{x*) 

5:  Add  the  subgradient  cut  (|7| 

6:  Compute  Q(x*) 

7:  Update  CGLP. 

8:  V  <-  VU{x*} 

9:  if  8*  <  Q(x*)  then 
10:  Solve  CGLP  to  obtain  a  and  tj 

11:  Add  the  integer  optimality  cut  (14) 

12:  return  false 

13:  else 

14:  return  true 

15:  end  if 


and  Vf'+1  =  V\  U  {x1}.  Now,  to  obtain  Vlt+v  observe  that 

Vm  x{0,l}  1\V/+1 


V?+1 


y/”1  X  {0,1}  u  {x!',x!}l  \  \vi  u  {?} 
v;-1  x  {0, 1}  U  {xz}l  \  \vl  U{xf} 

( [v/-1  x  {0, 1}]  \  [vj  U  {x;}] )  U  ({x‘}  \  [vi  U  {x'}] ) 
(Vl\{xi})u({xi}\  Vl 


Therefore,  if  x1  £  V}  anc^  i  V} ,  then  xl  is  included  in  V},  v  Also,  if  x'  G  V},  then  x!  is  removed  to 
obtain  .  Further  observe  that  both  operations  cannot  occur  at  the  same  iteration  since  the  equiva¬ 
lence 

xf  e  v;  x}  e  v;  a  xl  i  vj 

implies  that  x‘  (f_  V{  and  xl  G  V}  cannot  hold  true  at  the  same  time. 

It  follows  that  updating  V  involves  adding  or  removing  at  most  one  vector  for  each  Wl,  totaling  at 
most  n  such  operations.  The  system  CGLP  is  updated  accordingly  by  appending  or  dropping  at  most 
n  rows  in  (|TT|.  Also,  xt+1  takes  the  place  of  xf  in  1 13 1  and  the  cut  corresponding  to  xf  now  takes  the 
form  (12)  by  changing  the  equality  sign  into  inequality.  The  procedure  to  update  CGLP  is  shown  in 
Algorithm  [5] 


4  Combined  method 


Now  we  outline  an  implementation  of  the  integer  L-shaped  method  that  combines  the  alternating 
strategy  discussed  in  Section |2] with  the  new  optimality  cuts  presented  in  Section[3] 
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Algorithm  5  Updating  CGLP 

Input:  CGLP,  V',  V‘,  t,  xt+1  —  {x\, . . .  ,xn) 

Output:  Updated  CGLP,  V1,  V' 

l 

for  1  <  i  <  n  do 

2 

x‘  (xi,...,Xj_i,X,-) 

3 

x'  <T-  (xi,  .  .  . ,  X;_i,  1  ~  Xj) 

4 

if  x1  £  V1  and  xl  £  V'  then 

5 

iv  <—  x‘  x  {0}”-! 

6 

Add  wi t  +  pi  >  0  to  (IT) 

7 

V1  <-  V1  U  {x1} 

8 

end  if 

9 

if  x1  G  V1  then 

10 

w  <—  xl  x  {0}"-' 

11 

Remove  zvj  +  p;  >  0  from  (Tl) 

12 

V'  <-  V1  \  {x'j 

13 

end  if 

14 

V ’  <-  V1  U  {x'j 

15 

end  for 

16 

17 

Add  x{x  +  Qjx*)  +  ij  >  0  to  1 
Replace  (fl3)  with  xt+1a  +  Q( 

UZ 

+ 

II 

o 

We  keep  two  disjoint  lists  of  first-stage  solutions:  in  Vpp  we  include  solutions  for  which  only  Qlp{x) 
has  been  computed,  while  in  V  we  keep  solutions  for  which  Q(x)  has  been  evaluated.  At  any  given 
stage,  we  assume  that  for  each  x  €  V  we  have  added  an  optimality  cut  that  is  tight  at  x.  Now,  when 
a  candidate  integer  solution  ( x*,z*,8 *)  is  found  in  the  master  tree,  we  check  whether  x*  £  V  or  not. 
If  so,  we  accept  the  solution  as  we  already  know  Q(x*)  <  8*.  Now,  if  x*  £  Vpp,  then  we  compute 
Qlp(x*),  we  add  x*  into  Vlp,  and  in  case  8  <  Qlp(x*),  we  add  the  subgradient  cut  (|7j.  At  this  point,  if 
(x*,  8* )  has  been  neither  accepted  nor  rejected,  we  have  x*  e  Vpp  and  8*  >  Qlp(x*).  Thus  we  compute 
Q(x*),  we  move  x*  from  Vpp  to  V,  and  in  case  8  <  Q(x*),  we  add  the  CGLP-based  cut  (li)  and  accept 
the  solution  otherwise.  Algorithm  |6]below  presents  the  method. 


5  Results 


In  this  section  we  address  the  performance  of  the  variants  of  the  integer  L-shaped  method  discussed 
so  far.  Given  that  the  implementations  differ  in  the  cut  strategy  used  and  in  the  type  of  optimality  cut 
added,  we  consider  the  following  combinations: 

1.  Std-Std:  standard  cut  strategy  and  standard  optimality  cut  (J6J;  see  Section  [l] 

2.  Alt-Std:  alternating  cut  strategy  and  standard  optimality  cut  (6);  see  Section]?] 

3.  Std-CGLP:  standard  cut  strategy  and  new  optimality  cut  (l4j;  see  Section]?] 

4.  Alt-CGLP:  alternating  cut  strategy  and  new  optimality  cut  (14);  see  Section]?] 


In  other  words,  Std-Std  corresponds  to  the  usual  implementation  of  the  integer  L-shaped  method,  on 
top  of  which  the  variants  are  built. 


Our  computational  implementation  uses  CPLEX  12.5.0.1  as  a  solver  and  its  Callable  Library  for  ad¬ 
vanced  control  routines.  Since  either  optimality  cuts  (6)  or  1 14 1  are  part  of  the  complete  formulation 
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Algorithm  6  Optimality  cut  function  with  alternating  cut  strategy  and  CGLP-based  optimality  cuts 

Input:  (x*,z*,  8* )  candidate  integer  solution,  Q  :  X  — »  ]R,  Qlp  :  X  — >  ]R,  V,  Vlp 
Output:  true  if  solution  is  feasible,  false  otherwise 
l:  if  x*  G  V  then  / /  We  know  8*  >  Q(x*) 

2:  return  true 

3:  end  if 

4:  if  x*  i  Vpp  then 
5:  Compute  Qt p(x*). 

6:  VLP^VLPU{X*}. 

7:  if  8*  <  Qlp(x*)  then 

8:  Add  the  subgradient  cut  j7|. 

9:  return  false 

10:  end  if 

li:  end  if 

/  /  Now  we  have  x*  £  Vlp  and  8*  >  Qlp{x*) 

12:  Compute  Q(x*). 

13:  Update  CGLP. 

14:  V  <~VU{X*}. 

15:  Vlp  <—  Vlp  \  {x*} 

16:  if  8*  <  Q(x*)  then 

17:  Solve  CGLP  to  obtain  a  and  t] 

18:  Add  the  integer  optimality  cut  (14} 

19:  return  false 

20:  else 

21:  return  true 

22:  end  if 


(MP)  but  not  included  from  the  beginning,  we  have  to  add  them  on-the-fly  through  the  optimality  cut 
function.  This  routine  is  called  every  time  the  solver  finds  a  candidate  integer  solution  to  the  master 
problem  and  is  in  charge  of  generating  an  optimality  cut  if  needed.  In  the  case  of  CGLP,  it  calls  addi¬ 
tional  subroutines  to  make  the  required  updates  to  generate  (14).  We  include  the  formulation  of  the 
first-stage  set  in  CGLP,  along  the  subgradients  cuts  derived  from  the  linear  relaxation  of  Q(x)  used  in 
Benders'  decomposition. 

The  experiments  were  carried  out  on  a  personal  computer  with  3.33  Ghz  CPU,  4  Gb  of  RAM,  and 
running  Linux.  A  relative  optimality  gap  of  0.01%  was  set  as  stopping  criterion  and  a  time  limit  of 
7200  seconds  was  imposed.  We  do  not  report  on  the  extensive  form  of  the  instances  as  solving  them 
using  an  off-the-shelf  solver  is  much  slower  than  the  decomposition  approaches. 


5.1  Stochastic  server  location  problem 

The  stochastic  server  location  problem  is  described  in  [12].  Given  n  locations,  in  the  first  stage  we  are 
asked  to  decide  where  to  place  servers  so  that  the  demand  given  by  m  potential  customers  is  satisfied 
in  the  second  stage.  The  uncertain  data  is  the  set  of  customers  to  be  served  in  the  second  stage  and  the 
objective  is  to  maximize  the  expected  second-stage  revenue  minus  the  first-stage  installation  costs.  In 
minimization  form,  the  problem  can  be  written  as 

min  cx  +  Q(x) 
s.t.  x£{0,l}n, 
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where  Q(x)  :=  E?[Qf(x)]  and 


QfOO  :=  min  c\xy  +  q2s 

s.t.  W1y  +  W2s>h{£)-Tx 
y  e  {0,1}"'” 

S  e  R". 


The  random  right-hand-side  vector  /i(£)  represents  the  set  of  active  customers  in  a  given  scenario. 

We  tested  our  methods  on  the  instances  presented  in  11131.  Instances  SSLP.n.m.k  have  n  locations,  m 
customers,  and  k  scenarios,  leading  to  n  binary  variables  in  the  first  stage  and  nm  binary  variables  and 
n  nonnegative  variables  per  scenario  in  the  second  stage.  For  each  n  and  m,  five  replications  with  k 
scenarios  each  are  considered.  We  did  not  include  instances  having  n  —  5  as  all  of  them  took  less  than 
1  second  to  solve  with  any  method. 

Tables  [l] and  [2] summarize  our  results.  In  both  tables,  column  Instance  indicates  the  combination  of  n, 
m  and  k  as  above.  Headers  Std-Std,  Alt-Std,  Std-CGLP,  Alt-CGLP  denote  the  type  of  implementation 
under  consideration.  Here  we  present  the  averages  over  the  five  replications  of  each  instance.  Detailed 
results  are  given  in  Tables[6]and[7]in  the  Appendix. 

In  Table[l]we  present  the  overall  results  for  all  four  methods.  Columns  Nodes  show  the  average  number 
of  nodes  explored  in  the  master  problem.  Columns  Time  show  the  average  total  time  spent  to  reach 
optimality,  which  includes  computing  an  initial  lower  bound  L,  solving  the  LP  relaxation  with  Benders' 
decomposition,  and  exploring  and  evaluating  candidate  solutions  in  the  master  problem. 

From  Table  [lj  we  see  that  there  is  no  significant  variation  in  the  number  of  explored  nodes  among  the 
different  methods.  Now,  the  implementations  that  use  the  alternating  cut  strategy  clearly  outperform 
the  other  two  methods,  with  speedups  of  one  order  of  magnitude.  On  the  other  hand,  with  a  few 
exceptions,  the  use  of  CGLP-based  cuts  does  not  cause  major  changes  in  the  total  running  time,  espe¬ 
cially  when  combined  with  the  alternating  cut  strategy.  This  can  be  explained  by  the  fact  that  in  these 
problems,  the  first-stage  is  very  simple  as  X  =  {0, 1}”  with  n  <  15,  which  does  not  present  a  challenge 
for  CPLEX. 


Instance 


SSLP.10.50.50 

SSLP.10.50.100 

SSLP.10.50.500 

SSLP.10.50.1000 

SSLP.10.50.2000 

SSLP.15.45.5 

SSLP.15.45.10 

SSLP.15.45.15 

SSLP.15.45.20 

SSLP.15.45.25 


Std-Std 

Nodes  Time 


402.4 

70.9 

370.2 

91.1 

381.0 

548.5 

360.0 

1294.1 

392.2 

3298.0 

772.6 

81.5 

1408.0 

400.9 

1500.0 

534.3 

495.6 

358.4 

733.0 

708.4 

Std-CGLP 
Nodes  Time 


394.8 

71.5 

373.0 

90.5 

385.0 

561.7 

357.8 

1307.1 

371.4 

3160.7 

750.2 

89.0 

1370.8 

353.6 

1498.4 

539.1 

481.4 

347.8 

698.8 

704.9 

Alt-Std 

Nodes  Time 


406.8 

6.8 

371.8 

13.2 

386.8 

64.0 

367.4 

128.2 

404.4 

339.3 

763.4 

2.7 

1450.8 

6.1 

1526.0 

11.7 

500.4 

8.0 

737.8 

16.7 

Alt-CGLP 
Nodes  Time 


404.2 

6.8 

371.0 

13.6 

385.0 

65.5 

368.2 

129.3 

404.6 

336.7 

764.6 

2.8 

1414.0 

6.5 

1523.6 

11.9 

502.4 

8.1 

732.2 

17.4 

Table  1:  Stochastic  server  location:  overall  results. 


To  understand  the  effect  of  alternating  cuts,  in  Table[2]we  present  details  regarding  subproblems.  Recall 
that  every  time  a  candidate  integer  solution  is  found,  we  have  to  check  whether  it  is  feasible,  by  either 
solving  a  series  of  MIPs  or  LPs,  one  per  scenario,  and  then  add  a  cut  to  discard  the  solution  if  necessary. 
Headers  #LP  and  #MIP  denote  the  average  number  of  times  a  candidate  solution  was  checked  using 
linear  or  mixed-integer  subproblems,  while  headers  Time  LP  and  Time  MIP  indicate  the  average  time 
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spent  in  each  case.  We  focus  only  on  the  implementations  Std-Std  and  Alt-Std  as  the  comparison  for 
the  remaining  pair  is  similar. 

From  Table  [2j  we  see  that  with  the  alternating  cut  strategy  the  number  of  MIP  evaluations  reduces 
considerably.  This  means  that  in  the  problems  we  tested,  most  of  the  time  it  is  not  necessary  to  com¬ 
pute  the  exact  second-stage  value  of  a  given  first-stage  solution  to  reject  it.  Furthermore,  only  a  small 
fraction  of  these  solutions  are  visited  twice,  and  only  in  those  cases  we  have  to  solve  MIP  subproblems. 
The  benefits  are  evident. 


Instance 

#LP 

#MIP 

Std-Std 
Time  LP 

Time  MIP 

#LP 

#MIP 

Alt-Std 
Time  LP 

Time  MIP 

SSLP.10.50.50 

147.6 

147.6 

1.8 

65.8 

148.6 

3.4 

1.7 

1.8 

SSLP.10.50.100 

131.6 

131.6 

3.3 

81.0 

130.8 

3.8 

3.0 

3.5 

SSLP.10. 50.500 

131.6 

131.6 

16.4 

497.2 

130.6 

3.0 

14.8 

15.2 

SSLP.10.50.1000 

132.0 

132.0 

33.6 

1193.3 

127.2 

3.0 

30.2 

32.8 

SSLP.10. 50.2000 

142.6 

142.6 

72.4 

3082.5 

143.4 

4.2 

67.3 

133.2 

SSLP.15.45.5 

143.0 

143.0 

0.3 

80.6 

143.2 

5.8 

0.3 

1.9 

SSLP.15.45.10 

262.0 

262.0 

1.1 

398.2 

268.5 

5.3 

1.1 

3.6 

SSLP.15.45.15 

310.6 

310.6 

1.9 

530.1 

317.4 

6.0 

1.9 

7.9 

SSLP.15.45.20 

99.4 

99.4 

0.7 

356.1 

98.4 

3.2 

0.7 

5.9 

SSLP.15.45.25 

162.4 

162.4 

1.5 

704.3 

163.0 

5.4 

1.4 

12.8 

Table  2:  Stochastic  server  location:  subproblems  details. 


5.2  Stochastic  multiple  binary  knapsack  problem 

The  second  benchmark  set  corresponds  to  a  class  of  stochastic  multiple  binary  knapsack  problems. 
They  have  the  form 


min  cx  +  dz  +  Q(x ) 
s.t.  Ax  +  Cz  >  b 

x  e  {0,1  }n 
ze  {0,1}”, 

where  Q(x)  :=  E?[Q?(x)], 

QfM  :=  min  qiS)y 

s.t.  Wy  >  li  —  Tx 
V  e  {0/ 1}"/ 

and  all  data  are  nonnegative  integers.  In  the  second-stage  problem,  only  the  objective  vector  y(f)  is 
random,  following  a  discrete  distribution  with  finitely  many  scenarios. 

We  generated  30  instances  of  the  above  problem  with  n  —  120  and  20  equiprobable  scenarios.  The 
systems  Ax  +  Cz  >  b  and  Wy  >  h  —  Tx  have  50  and  5  rows,  respectively.  The  entries  of  A,  C,  T,  W,  c, 
d,  and  q  are  i.i.d.  sampled  from  the  uniform  distribution  over  {1, . . . ,  100}.  We  set  b  —  |  (A1  +  Cl)  and 
h  —  |(T1  +  Wl),  with  1  denoting  the  n-d  imensional  vector  of  ones. 

We  divided  the  instances  intro  three  groups  depending  on  how  much  time  the  standard  implemen¬ 
tation  took  to  solve  each  of  them:  Easy  (less  than  200  seconds,  instances  1-6),  Medium  (between  200 
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and  1000  seconds,  instances  7-18),  and  Hard  (more  than  1000  seconds,  instances  19-29).  We  ommitted 
instance  30  since  none  of  the  methods  was  able  to  solve  it  to  optimality  within  the  time  limit. 


Tables  [3|  |4j  and  [5]  below  summarize  the  results.  Column  Difficulty  denotes  the  instance  class.  The 
remaining  headers  and  columns  are  as  in  Tables  [l]  and  [2]  Detailed  results  are  given  in  Tables  |8j|9j  and 
TO  in  the  Appendix. 


Difficulty 

Std-Std 

Nodes 

Time 

Std-CGLP 

Nodes  Time 

Alt-Std 

Nodes 

Time 

Alt-CGLP 

Nodes  Time 

Easy 

151531.5 

87.1 

127696.0 

82.5 

154611.7 

86.0 

133724.2 

82.8 

Medium 

945487.8 

520.8 

714822.5 

453.8 

940249.3 

516.1 

748502.7 

446.7 

Hard 

3356158.1 

2125.7 

2654448.1 

1833.6 

3371088.5 

2065.2 

2656526.5 

1756.3 

Table  3:  Stochastic  multiple  knapsack:  overall  results. 


From  Table[3j  we  see  that  the  application  of  the  alternating  cut  strategy  does  not  yield  the  time  savings 
we  saw  with  the  stochastic  server  location  problems.  On  the  other  hand,  in  most  instances,  adding 
CGLP-based  cuts  instead  of  standard  cuts  yields  reductions  in  both  the  number  of  nodes  and  the  total 
time,  regardless  of  the  cut  strategy  being  used.  We  would  like  to  conclude  that  these  improvements 
are  due  to  the  fact  that  CGLP-based  cuts  help  to  explore  the  master  tree.  However,  at  this  point,  that  is 
not  completely  clear,  as  for  example,  time  reductions  could  be  consequence  of  less  evaluations  of  Q(x) 
and  not  because  of  the  strength  of  the  new  cuts. 

To  aid  our  analysis,  in  Table|4]we  report  the  average  number  of  candidate  solutions  for  which  Qip(x) 
and  Q(x)  were  evaluated  and  the  average  time  spent  doing  so.  This  time  we  compare  Std-Std  and 
Std-CGLP,  and  the  notation  is  similar  to  that  of  Table[2] 


Difficulty 

#LP 

#MIP 

Std-Std 
Time  LP 

Time  MIP 

#LP 

#MIP 

Std-CGLP 
Time  LP 

Time  MIP 

Easy 

13.7 

13.7 

0.0 

25.7 

14.7 

14.7 

0.0 

28.1 

Medium 

49.2 

49.2 

0.1 

99.1 

54.1 

54.1 

0.1 

107.2 

Hard 

112.3 

112.3 

0.2 

231.6 

114.9 

114.9 

0.2 

235.8 

Table  4:  Stochastic  multiple  knapsack:  subproblems  details. 


We  observe  that  both  implementations  require  roughly  the  same  number  of  evaluations  of  both  Qlp{x) 
and  Q(x),  which  explains  why  alternating  cuts  does  not  outperform  the  standard  cut  strategy.  More¬ 
over,  the  difference  in  the  time  solving  subproblems  is  very  small  compared  to  the  total  running  times 
presented  in  Table  [3]  Thus,  the  reductions  observed  in  Table  [3]  can  be  attributed  to  the  better  approxi¬ 
mation  of  the  first-stage  set  given  by  the  CGLP-based  cuts  and  not  to  the  variability  of  the  evaluations. 
In  this  regard,  it  is  important  to  stress  that,  in  principle,  having  a  better  description  of  the  first-stage  set 
does  not  have  a  direct  relationship  with  the  number  of  candidates  solutions  found  in  the  master  tree, 
and  actually,  having  more  candidates  could  hurt  the  total  running  time  if  their  evaluation  is  too  costly. 
However,  in  situations  where  after  decomposing  the  problem  the  burden  of  the  computation  lies  on 
the  master  problem,  our  improved  cuts  may  prove  beneficial  as  exemplified  by  our  results. 

Finally,  in  Table  [5]  we  present  the  overhead  incurred  by  using  CGLP  to  generate  cuts,  that  is,  the  time 
spent  in  additional  operations  to  maintain  and  solve  CGLP  through  the  method.  For  each  class,  column 
|V|  shows  the  average  final  size  of  V ,  which  is  the  number  of  candidate  solutions  for  which  Q(x) 
was  evaluated  exactly.  Headers  Update  and  Generate  denote  the  average  total  time  spent  updating 
the  formulation  of  CGLP  and  actually  solving  the  system  to  find  an  optimality  cut,  respectively.  This 
additional  time  is  already  included  in  the  total  running  time  presented  in  Table[3] 

As  expected,  the  overhead  increases  as  more  solutions  are  included  in  the  extended  formulation.  Up¬ 
dating  CGLP  takes  practically  no  time,  whereas  generating  the  cut  takes  a  nonnegligible  amount  of 
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Difficulty 

|V| 

Update 

Generate 

Easy 

14.7 

0.0 

0.5 

Medium 

54.1 

0.2 

7.2 

Hard 

114.9 

0.4 

24.7 

Table  5:  Stochastic  multiple  knapsack:  CGLP  overhead. 


time.  However,  compared  to  the  total  running  time,  the  overhead  is  very  small  and  the  effort  of  com¬ 
puting  improved  cuts  pays  off  as  shown  in  Table  [3]  For  more  complicated  problems  where  the  number 
of  binary  first-stage  variables  is  too  large  or  where  too  many  candidate  solutions  are  evaluated,  the  cost 
of  maintaining  CGLP  is  likely  to  be  higher.  In  those  cases,  we  can  enforce  rules  to  limit  the  number  of 
calls  to  CGLP,  such  as  using  the  standard  optimality  cuts  as  a  baseline  and  applying  the  improved  cuts 
only  once  in  a  while. 


6  Concluding  remarks 


In  this  work,  we  have  presented  two  modifications  to  the  integer  L-shaped  method  with  the  objective 
of  reducing  the  running  time  of  the  algorithm.  The  first  one,  termed  alternating  cuts  strategy,  seeks 
to  avoid  expensive  evaluations  of  the  second-stage  cost  function,  while  the  second,  the  use  of  CGLP- 
based  optimality  cuts,  helps  to  better  approximate  the  shape  of  the  epigraph  of  the  cost  function  when 
evaluations  at  different  points  are  available.  Our  computational  results  suggest  the  following: 

1.  The  alternating  cuts  strategy  works  better  in  problems  where  the  computational  bottleneck  of 
(IP)  is  in  evaluating  Q(x).  Even  when  that  is  not  the  case,  this  modification  does  not  seem  to  hurt 
the  total  running  times  and  thus  it  could  be  considered  as  the  base  method  on  top  of  which  more 
evolved  algorithms  can  be  built. 

2.  CGLP-based  cuts  are  a  viable  alternative  when  the  first-stage  set  is  difficult  to  explore  and  com¬ 
puting  Q(x)  is  a  relatively  cheap  operation.  As  the  sole  purpose  of  these  new  cuts  is  to  have  a 
better  representation  of  the  epigraph  of  the  second-stage  cost  function  within  the  master  problem, 
there  is  no  guarantee  about  the  number  or  the  sequence  of  solutions  for  which  Q(x)  is  evaluated, 
and  thus,  in  general,  this  method  performs  well  when  the  impact  of  this  variability  is  small  com¬ 
pared  with  the  effort  of  solving  the  master  problem. 

3.  We  also  point  out  that  our  overall  computational  experience  indicates  that  CGLP-based  cuts  are 
particularly  suitable  for  problems  having  additional  integer  variables  in  the  set  Z,  since  a  deep 
cut  discarding  a  point  (x*,G*)  in  the  (x,  0) -space  may  also  prove  effective  in  discarding  a  large 
number  of  points  of  the  form  (x*,z,  6* )  for  z  G  Z. 

4.  As  favorable  conditions  for  both  modifications  are  unlikely  to  be  attained  at  the  same  time,  we 
observe  that  time  reductions  in  a  combined  method  are  mainly  consequence  of  one  strategy  or 
the  other,  but  not  because  of  the  combination  of  both.  That  being  said,  it  would  be  interesting  to 
experiment  with  implementations  where  CGLP  also  incorporates  approximations  of  Q(x)  such  as 
subgradient  cuts  or  ad-hoc  lower  bounds  rather  than  exact  evaluations  only.  That  would  require 
also  keeping  track  of  firt-stage  vectors  x  for  which  estimates  of  Q(x)  have  been  computed. 

5.  Finally,  in  more  general  settings  where  Q(x)  is  an  easy-to-evaluate  nonconvex  function  for  which 
a  tractable  convex  underestimator  is  not  available,  CGLP-based  cuts  may  prove  helpful  in  solving 
problems  having  the  form  (IP).  Situations  where  Q(x)  is  given  by  black-box  computations  remain 
a  case  study  to  be  explored. 
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Appendix 


Stochastic  server  location  problem 


Instance 

Rep. 

Std-Std 

Nodes  Time 

Std-CGLP 
Nodes  Time 

Alt-Std 

Nodes  Time 

Alt-CGLP 
Nodes  Time 

a 

478 

61.1 

485 

67.8 

468 

7.0 

466 

7.1 

b 

452 

91.3 

434 

85.5 

472 

6.7 

466 

6.7 

SSLP.10.50.50 

c 

300 

79.3 

297 

80.0 

303 

7.0 

298 

7.1 

d 

237 

25.3 

224 

26.9 

230 

5.0 

230 

5.0 

e 

545 

97.5 

534 

97.2 

561 

8.2 

561 

8.3 

a 

452 

109.7 

434 

106.2 

462 

17.1 

470 

18.3 

b 

497 

80.3 

494 

83.7 

493 

11.0 

493 

11.1 

SSLP.10.50.100 

c 

313 

95.3 

291 

98.0 

302 

12.8 

289 

13.6 

d 

216 

49.5 

224 

43.6 

229 

10.7 

229 

10.5 

e 

373 

120.5 

422 

121.0 

373 

14.2 

374 

14.4 

a 

466 

605.5 

470 

643.9 

472 

63.3 

476 

63.5 

b 

441 

482.6 

447 

492.9 

449 

57.7 

449 

57.8 

SSLP.10.50.500 

c 

277 

571.7 

292 

557.6 

275 

64.0 

271 

64.8 

d 

235 

348.8 

239 

353.5 

247 

57.5 

247 

57.6 

e 

486 

733.8 

477 

760.8 

491 

77.5 

482 

84.0 

a 

481 

1542.1 

473 

1549.6 

486 

134.5 

487 

135.4 

b 

473 

1128.7 

477 

1142.2 

460 

114.5 

466 

116.8 

SSLP.10.50.1000 

c 

276 

1509.3 

261 

1509.7 

282 

124.2 

279 

125.5 

d 

225 

752.8 

227 

782.2 

229 

113.2 

229 

113.7 

e 

345 

1537.6 

351 

1551.8 

380 

154.4 

380 

155.3 

a 

466 

3777.1 

467 

3769.2 

472 

382.7 

478 

373.2 

b 

472 

2565.3 

471 

2751.8 

483 

246.7 

478 

251.0 

SSLP.10.50.2000 

c 

286 

3189.4 

286 

3158.8 

302 

368.9 

300 

360.5 

d 

219 

1937.1 

219 

1994.5 

223 

249.0 

225 

249.4 

e 

518 

5021.2 

414 

4129.2 

542 

449.4 

542 

449.6 

a 

230 

11.3 

233 

11.6 

244 

0.7 

244 

0.7 

b 

261 

2.9 

262 

3.0 

270 

0.5 

262 

0.5 

SSLP.15.45.5 

c 

2364 

320.9 

2288 

354.9 

2298 

9.9 

2294 

10.2 

d 

870 

56.2 

826 

58.7 

872 

1.4 

888 

1.7 

e 

138 

16.4 

142 

16.8 

133 

1.0 

135 

1.0 

a 

430 

79.0 

442 

80.1 

429 

2.7 

428 

2.8 

b 

284 

189.0 

251 

190.9 

256 

6.2 

278 

7.6 

SSLP.15.45.10 

c 

2384 

245.0 

2240 

236.6 

2512 

7.4 

2449 

7.7 

d 

2534 

1090.7 

2550 

906.8 

2606 

7.9 

2501 

8.0 

a 

1408 

1646.1 

1329 

1594.5 

1368 

13.1 

1358 

13.3 

b 

223 

55.7 

216 

55.5 

212 

2.3 

219 

2.3 

SSLP.15.45.15 

c 

2676 

580.6 

2718 

611.0 

2791 

19.0 

2785 

18.9 

d 

2986 

359.1 

2994 

404.1 

3038 

22.3 

3024 

23.0 

e 

207 

30.1 

235 

30.2 

221 

1.9 

232 

1.9 

a 

498 

186.4 

469 

181.4 

506 

4.0 

523 

4.1 

b 

351 

87.2 

335 

87.5 

341 

7.6 

331 

7.6 

SSLP.15.45.20 

c 

380 

196.8 

358 

193.4 

380 

5.1 

387 

5.2 

d 

552 

873.0 

548 

898.1 

560 

20.7 

562 

20.9 

e 

697 

448.4 

697 

378.5 

715 

2.8 

709 

2.8 

a 

658 

554.1 

629 

532.0 

662 

18.4 

633 

18.5 

b 

671 

324.7 

620 

435.1 

670 

9.0 

680 

6.7 

SSLP.15.45.25 

c 

433 

165.2 

399 

160.7 

447 

11.8 

422 

11.9 

d 

965 

435.2 

946 

465.7 

967 

26.9 

1001 

32.3 

e 

938 

2062.7 

900 

1931.0 

943 

17.6 

925 

17.8 

Table  6:  Stochastic  server  location:  overall  results  per  instance. 
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Instance 

Rep. 

#LP 

#MIP 

Std-Std 
Time  LP 

Time  MIP 

#LP 

#MIP 

Alt-Std 
Time  LP 

Time  MIP 

a 

189 

189 

2.2 

55.5 

185 

3 

2.1 

1.7 

b 

154 

154 

1.6 

86.2 

166 

3 

1.6 

1.6 

SSLP.10.50.50 

c 

113 

113 

1.6 

74.5 

109 

4 

1.6 

2.3 

d 

44 

44 

0.6 

21.3 

45 

2 

0.6 

0.9 

e 

238 

238 

2.8 

91.6 

238 

5 

2.7 

2.6 

a 

181 

181 

4.3 

98.4 

183 

6 

3.9 

6.4 

b 

176 

176 

4.0 

69.7 

175 

2 

3.7 

0.9 

SSLP.10.50.100 

c 

112 

112 

3.1 

86.2 

109 

5 

2.9 

4.1 

d 

51 

51 

1.3 

40.9 

50 

2 

1.2 

2.2 

e 

138 

138 

3.7 

109.8 

137 

4 

3.3 

4.0 

a 

178 

178 

21.1 

549.8 

179 

2 

19.1 

11.0 

b 

152 

152 

17.8 

428.5 

150 

2 

15.2 

7.4 

SSLP.10.50.500 

c 

89 

89 

13.7 

523.9 

89 

4 

12.0 

18.6 

d 

56 

56 

8.1 

303.3 

56 

2 

8.1 

12.4 

e 

183 

183 

21.1 

680.4 

179 

5 

19.8 

26.7 

a 

188 

188 

46.4 

1429.6 

185 

3 

41.9 

29.4 

b 

163 

163 

36.6 

1028.5 

156 

2 

32.4 

21.2 

SSLP.10.50.1000 

c 

106 

106 

29.6 

1410.1 

95 

3 

25.8 

30.2 

d 

56 

56 

16.0 

665.2 

55 

2 

15.3 

27.3 

e 

147 

147 

39.4 

1433.1 

145 

5 

35.5 

56.1 

a 

184 

184 

92.6 

3548.0 

181 

5 

82.4 

169.9 

b 

158 

158 

70.3 

2352.7 

156 

2 

65.4 

44.2 

SSLP.10.50.2000 

c 

98 

98 

60.7 

2980.4 

103 

5 

56.6 

167.0 

d 

59 

59 

34.2 

1746.0 

58 

2 

31.2 

62.1 

e 

214 

214 

104.3 

4785.4 

219 

7 

101.0 

222.7 

a 

28 

28 

0.1 

10.9 

28 

2 

0.1 

0.2 

b 

42 

42 

0.1 

2.5 

41 

4 

0.1 

0.2 

SSLP.15.45.5 

c 

481 

481 

1.0 

318.3 

496 

17 

0.9 

7.7 

d 

154 

154 

0.3 

55.2 

141 

4 

0.3 

0.6 

e 

10 

10 

0.0 

16.1 

10 

2 

0.0 

0.7 

a 

93 

93 

0.3 

77.8 

90 

2 

0.3 

1.6 

b 

68 

68 

0.2 

188.2 

67 

5 

0.2 

5.4 

SSLP.15.45.10 

c 

501 

501 

2.2 

240.1 

538 

9 

2.3 

2.9 

d 

386 

386 

1.7 

1086.8 

379 

5 

1.7 

4.4 

a 

263 

263 

1.6 

1642.3 

262 

4 

1.5 

9.7 

b 

41 

41 

0.2 

54.4 

39 

2 

0.2 

1.0 

SSLP.15.45.15 

c 

623 

623 

4.3 

572.8 

645 

16 

4.4 

11.8 

d 

597 

597 

3.3 

352.0 

613 

6 

3.1 

16.2 

e 

29 

29 

0.2 

29.0 

28 

2 

0.2 

0.8 

a 

134 

134 

0.9 

183.7 

132 

2 

0.9 

1.4 

b 

63 

63 

0.4 

85.1 

61 

2 

0.4 

5.6 

SSLP.15.45.20 

c 

61 

61 

0.4 

195.2 

60 

4 

0.4 

3.6 

d 

148 

148 

1.1 

870.2 

145 

6 

1.0 

18.0 

e 

91 

91 

0.7 

446.2 

94 

2 

0.7 

0.7 

a 

156 

156 

1.3 

550.0 

147 

4 

1.2 

14.4 

b 

135 

135 

1.3 

321.0 

148 

4 

1.4 

5.3 

SSLP.15.45.25 

c 

73 

73 

0.6 

162.1 

74 

4 

0.6 

8.8 

d 

213 

213 

2.2 

430.0 

215 

7 

2.1 

21.9 

e 

235 

235 

2.0 

2058.5 

231 

8 

1.9 

13.7 

Table  7:  Stochastic  server  location:  subproblems  details  per  instance. 
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Stochastic  multiple  binary  knapsack  problem 


Instance 

Std-Std 

Std-CGLP 

Alt-Std 

Alt-CGLP 

Nodes 

Time 

Nodes 

Time 

Nodes 

Time 

Nodes 

Time 

1 

27705 

26.4 

27615 

27.0 

27837 

25.5 

26295 

24.9 

2 

63528 

41.1 

55448 

38.6 

65213 

41.1 

57170 

38.2 

3 

93185 

59.9 

87560 

60.2 

101121 

57.8 

81480 

50.9 

4 

137303 

101.1 

121687 

97.3 

132782 

89.1 

127963 

89.1 

5 

224063 

107.2 

183462 

94.1 

244755 

112.1 

251017 

128.3 

6 

363405 

186.6 

290404 

177.5 

355962 

190.1 

258420 

165.5 

7 

503998 

245.8 

401809 

204.4 

517313 

250.4 

397677 

200.2 

8 

436738 

267.5 

310136 

218.0 

431356 

249.3 

334569 

214.8 

9 

470356 

273.3 

451931 

269.7 

502174 

280.5 

450104 

254.8 

10 

507120 

315.6 

320672 

251.1 

518329 

333.1 

342582 

257.5 

11 

623424 

379.4 

675292 

404.9 

637749 

422.5 

615580 

342.6 

12 

887595 

468.7 

672117 

422.7 

954211 

502.8 

741931 

436.2 

13 

1099397 

541.0 

1024147 

692.2 

1172464 

579.1 

984003 

527.4 

14 

1416129 

686.6 

880154 

516.9 

1484427 

711.5 

1057895 

600.3 

15 

1650580 

714.4 

1120524 

509.8 

1692521 

726.8 

1148229 

516.0 

16 

1322774 

749.9 

832266 

533.7 

1013473 

572.2 

956447 

579.2 

17 

1197577 

771.1 

900476 

652.4 

1192205 

753.2 

974525 

686.0 

18 

1230166 

836.7 

988346 

769.7 

1166769 

811.6 

978490 

745.9 

19 

2189204 

1158.0 

1618305 

950.0 

222539.3 

1160.4 

1713778 

962.0 

20 

2395096 

1460.9 

1663945 

1142.5 

2383548 

1404.2 

1756720 

1109.5 

21 

3277812 

1488.2 

2789613 

1328.8 

3563188 

1603.1 

3144784 

1499.3 

22 

2702878 

1664.7 

2244862 

1422.6 

2816341 

1714.0 

2087732 

1430.1 

23 

2309196 

1825.3 

1919811 

1711.6 

2306792 

1715.5 

1833302 

1520.5 

24 

3301135 

1998.1 

2690441 

1771.6 

3101311 

1816.8 

2580654 

1620.4 

25 

3346788 

2310.7 

2987190 

2149.9 

3541754 

2346.8 

2998747 

2068.1 

26 

3024670 

2319.8 

2966064 

2373.0 

3087399 

2258.1 

2806757 

2172.3 

27 

3890594 

2344.4 

3225433 

2099.7 

3787260 

2210.1 

3128508 

1980.4 

28 

4762714 

3223.2 

3253202 

2425.3 

4449516 

2890.2 

3285741 

2311.3 

29 

5717652 

3589.2 

3840063 

2795.1 

5819471 

3597.8 

3885068 

2645.9 

Table  8:  Stochastic  multiple  knapsack:  overall  results  per  instance. 


24 

DISTRIBUTION  A:  Distribution  approved  for  public  release. 


Instance 

#LP 

#MIP 

Std-Std 
Time  LP 

Time  MIP 

#LP 

#MIP 

Std-CGLP 
Time  LP 

Time  MIP 

1 

9 

9 

0.0 

14.4 

9 

9 

0.0 

14.7 

2 

9 

9 

0.0 

15.7 

9 

9 

0.0 

15.8 

3 

14 

14 

0.0 

24.7 

14 

14 

0.0 

26.0 

4 

24 

24 

0.0 

46.2 

24 

24 

0.0 

45.9 

5 

12 

12 

0.0 

21.0 

12 

12 

0.0 

19.8 

6 

14 

14 

0.0 

32.1 

20 

20 

0.0 

46.2 

7 

10 

10 

0.0 

17.3 

10 

10 

0.0 

17.2 

8 

40 

40 

0.1 

80.3 

40 

40 

0.1 

80.7 

9 

34 

34 

0.1 

74.5 

36 

36 

0.1 

74.6 

10 

46 

46 

0.1 

97.9 

49 

49 

0.1 

102.1 

11 

46 

46 

0.1 

77.6 

47 

47 

0.1 

78.9 

12 

45 

45 

0.1 

108.5 

51 

51 

0.1 

123.0 

13 

45 

45 

0.1 

87.2 

87 

87 

0.2 

160.9 

14 

51 

51 

0.1 

124.4 

51 

51 

0.1 

123.9 

15 

22 

22 

0.0 

29.9 

26 

26 

0.1 

36.2 

16 

79 

79 

0.2 

128.8 

74 

74 

0.2 

119.3 

17 

80 

80 

0.2 

168.0 

81 

81 

0.2 

167.5 

18 

92 

92 

0.2 

194.7 

97 

97 

0.2 

202.1 

19 

66 

66 

0.1 

134.3 

65 

65 

0.1 

131.9 

20 

97 

97 

0.2 

193.0 

98 

98 

0.2 

193.9 

21 

49 

49 

0.1 

99.8 

48 

48 

0.1 

97.7 

22 

93 

93 

0.2 

245.2 

91 

91 

0.2 

237.2 

23 

175 

175 

0.4 

341.7 

176 

176 

0.4 

339.1 

24 

89 

89 

0.2 

211.2 

92 

92 

0.2 

221.1 

25 

127 

127 

0.3 

222.2 

127 

127 

0.3 

221.8 

26 

155 

155 

0.3 

331.5 

157 

157 

0.3 

331.4 

27 

103 

103 

0.2 

246.0 

111 

111 

0.2 

264.1 

28 

150 

150 

0.3 

263.7 

152 

152 

0.3 

268.0 

29 

131 

131 

0.3 

259.4 

147 

147 

0.3 

287.6 

Table  9:  Stochastic  multiple  knapsack:  subproblems  details  per  instance. 


Instance 

\v\ 

Update 

Generate 

1 

9 

0.0 

0.2 

2 

9 

0.0 

0.2 

3 

14 

0.0 

0.4 

4 

24 

0.0 

1.0 

5 

12 

0.0 

0.3 

6 

20 

0.0 

0.8 

7 

10 

0.0 

0.2 

8 

40 

0.1 

2.7 

9 

36 

0.1 

2.5 

10 

49 

0.1 

3.4 

11 

47 

0.1 

4.7 

12 

51 

0.1 

4.5 

13 

87 

0.2 

13.2 

14 

51 

0.1 

5.3 

15 

26 

0.1 

1.3 

16 

74 

0.2 

11.9 

17 

81 

0.3 

15.7 

18 

97 

0.4 

21.2 

19 

65 

0.2 

6.9 

20 

98 

0.3 

21.7 

21 

48 

0.1 

4.0 

22 

91 

0.3 

18.8 

23 

176 

0.9 

41.2 

24 

92 

0.3 

13.8 

25 

127 

0.4 

29.8 

26 

157 

0.6 

37.0 

27 

111 

0.4 

25.1 

28 

152 

0.6 

39.6 

29 

147 

0.6 

33.3 

Table  10:  Stochastic  multiple  knapsack:  CGLP  overhead  per  instance. 
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Abstract 

It  is  well-known  that  the  intersection  of  the  matching  polytope  with  a  cardinality  constraint  is  integral  [8]. 
In  this  note,  we  prove  a  similar  result  for  the  polytope  corresponding  to  the  transportation  problem  with 
market  choice  (TPMC)  (introduced  in  [4])  when  the  demands  are  in  the  set  {1,2}.  This  result  generalizes 
the  result  regarding  the  matching  polytope.  The  result  in  this  note  uses  the  fact  that  some  special  classes  of 
minimum  weight  perfect  matching  problem  with  a  cardinality  constraint  on  a  subset  of  edges  can  be  solved 
in  polynomial  time. 

Keywords:  Transportation  problem  with  market  choice,  cardinality  constraint,  integral  polytope 


1.  Introduction  and  Main  Result 

1.1.  Transportation  Problem  with  Market  Choice 

The  transportation  problem  with  market  choice  (TPMC),  introduced  in  the  paper  [4],  is  a  transportation 
problem  in  which  suppliers  with  limited  capacities  have  a  choice  of  which  demands  (markets)  to  satisfy.  If  a 
market  is  selected,  then  its  demand  must  be  satisfied  fully  through  shipments  from  the  suppliers.  If  a  market 
is  rejected,  then  the  corresponding  potential  revenue  is  lost.  The  objective  is  to  minimize  the  total  cost  of 
shipping  and  lost  revenues.  See  [5,  7,  9]  for  approximation  algorithms  and  heuristics  for  several  other  supply 
chain  planning  and  logistics  problems  with  market  choice. 

Formally,  we  are  given  a  set  of  supply  and  demand  nodes  that  form  a  bipartite  graph  G  =  (V)  U  V2,  E). 
The  nodes  in  set  Vj  represent  the  supply  nodes,  where  for  i  £  Vi,  s*  £  N  represents  the  capacity  of  supplier 
i.  The  nodes  in  set  V2  represent  the  potential  markets,  where  for  j  £  V2,  dj  £  N  represents  the  demand  of 
market  j.  The  edges  between  supply  and  demand  nodes  have  weights  that  represent  shipping  costs  we,  where 
e  £  E.  For  each  j  £  V 2,  Vj  is  the  revenue  lost  if  the  market  j  is  rejected.  Let  xuj\  be  the  amount  of  demand 
of  market  j  satisfied  by  supplier  i  for  {i,j}  £  E ,  and  let  Zj  be  an  indicator  variable  taking  a  value  1  if  market 
j  is  rejected  and  0  otherwise.  A  mixed-integer  programming  (MIP)  formulation  of  the  problem  is  given  where 
the  objective  is  to  minimize  the  transportation  costs  and  the  lost  revenues  due  to  unchosen  markets: 


min  \  wexe  +  N  VjZj 

,ze{0,l}lv2l  e£E  jev2 

(1) 

S.t.  X{i,j}  —  dj(  1  —  Zj) 

Vj  €  V2 

(2) 

i:{i,j}eE 

'P.  —  Si 

Vi  £  Vi. 

(3) 

j:{i,j}eE 
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We  refer  to  the  formulation  (l)-(3)  as  TPMC.  The  first  set  of  constraints  (2)  ensures  that  if  market  j  £  V2 
is  selected  (i.e.,  z3  =  0),  then  its  demand  must  be  fully  satisfied.  The  second  set  of  constraints  (3)  model  the 
supply  restrictions. 

TPMC  is  strongly  NP-complete  in  general  [4].  Aardal  and  Le  Bodic  [1]  give  polynomial-time  reduc¬ 
tions  from  this  problem  to  the  capacitated  facility  location  problem  [6],  thereby  establishing  approximation 
algorithms  with  constant  factors  for  the  metric  case  and  a  logarithmic  factor  for  the  general  case. 

1.2.  TMPC  with  dj  £  {1,2}  for  all  j  £  Vi  and  the  Matching  Polytope 

When  dj  £  {1,2}  for  each  demand  node  j  £  Vi,  TPMC  is  polynomially  solvable  [4].  This  is  proven  through 
a  reduction  to  a  minimum  weight  perfect  matching  problem  on  a  general  (non-bipartite)  graph  G'  =  (Vr ,  E')\ 
see  [4].  We  call  this  special  class  of  the  problem,  the  simple  TPMC  problem  in  the  rest  of  this  note. 

Observation  1  (Simple  TPMC  generalizes  Matching  on  General  Graphs).  The  matching  problem  can  be 
seen  as  a  special  case  of  the  simple  TPMC  problem.  Let  G  =  ( V ,  E)  be  a  graph  with  n  vertices  and  m  edges. 
We  construct  a  bipartite  graph  G  =  ( V 1  U  V2,E)  as  follows:  V1  is  a  set  of  n  vertices  corresponding  to  the  n 
vertices  in  G,  and  V2  corresponds  to  the  set  of  edges  of  G,  i.e.,  V2  contains  m  vertices.  We  use  {i,j}  to  refer 
to  the  vertex  in  V2  corresponding  to  the  edge  {i,j}  in  E.  The  set  of  edges  in  E  are  of  the  form  {i,  {i,  j}}  and 
{ 7,  {*,  ?}}  for  every  i,  j  £  V  such  that  U,  j}  £  E.  Now  we  can  construct  (the  feasible  region  of)  an  instance 
of  TPMC  with  respect  to  G  =  ( V 1  U  V2,E)  as  follows: 

Q  =  {(*,  z)  £  R2m  x  Rm  |  x{i>e}  +  xae}+2 ze  =  2  Ve  =  {i,jj  £  V2  (4) 

X!  <  1  V*  G  V1  (5) 

ze  £  {0, 1}  Ve  G  V2}.  (6) 

Clearly  there  is  a  bijection  between  the  set  of  matchings  in  G  and  the  set  of  solutions  in  Q.  Moreover,  let 

H  :=  {(*,  y)  £  R2m  x  T  x  Rm  |  (x,  z)  £  Q,  y  =  e  -  z}, 

where  e  is  the  all  ones  vector  in  Rm.  Then  we  have  that  the  incidence  vector  of  all  the  matchings  inG  =  (V,  E) 
is  precisely  the  set  proj y(H). 

Note  that  the  instances  of  the  form  of  (4)- (6)  are  special  cases  of  simple  TPMC  instances,  since  in  these 
instances  all  Si ’s  are  restricted  to  be  exactly  1  and  all  dj ’s  are  restricted  to  be  exactly  2. 

1.3.  Simple  TPMC  with  Cardinality  Constraint:  Main  Result 

An  important  and  natural  constraint  that  one  may  add  to  the  TPMC  problem  is  that  of  a  service  level, 

that  is  the  number  of  rejected  markets  is  restricted  to  be  at  most  k.  This  restriction  can  be  modelled  using 

a  cardinality  constraint,  Ylj^v2zj  —  appended  to  (l)-(3).  We  call  the  resulting  problem  cardinality- 
constrained  TPMC  (CCTPMC).  If  we  are  able  to  solve  CCTPMC  in  polynomial-time,  then  we  can  solve 
TPMC  in  polynomial  time  by  solving  CCTPMC  for  all  k  £  {0, . . . ,  \Vi\}.  Since  TPMC  is  NP-hard,  CCTPMC 
is  NP-hard  in  general. 

In  this  note,  we  examine  the  effect  of  appending  a  cardinality  constraint  to  the  simple  TPMC  problem. 

Theorem  1.  Given  an  instance  of  TPMC  with  Vi,  the  set  of  demand  nodes,  and  E,  the  set  of  edges,  let 
X  C  x  { 0 , 1 } I ^ I  be  the  set  of  feasible  solutions  of  this  instance  of  TPMC.  Let  k  £  Z+  and  k  <  \Vi\. 
Let  Xk  :=  conv(A  D  {(x,z)  G  R^'  x  {0,  l}ly2l  |  Zj  <  k}).  If  dj  <  2  for  all  j  G  V2,  then  Xk  = 

conv(X)  n  {(a;,  z)  £  1  x  [0,  l]^  |  Ejey2  zi  <  k) ■ 

Our  proof  of  Theorem  1  is  presented  in  Section  2.  We  note  that  the  result  of  Theorem  1  holds  even  when 
Xk  is  defined  as  conv(X  (~l  {(x,  z)  G  x  {0, 1}IV2|  |  Ej6y2  zj  >  k})  or  conv(X  Cl  {(#,  z)  G  x  {0, 1}^  \ 
^jev2zo  =  k})- 

By  invoking  the  ellipsoid  algorithm  and  the  use  of  Theorem  1  we  obtain  the  following  corollary. 
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Corollary  1.  Cardinality  constrained  simple  TPMC  is  polynomially  solvable. 

We  note  that,  as  a  consequence  of  Theorem  1  (but  also  inherent  in  our  proof),  a  special  class  of  minimum 
weight  perfect  matching  problem  with  a  cardinality  constraint  on  a  subset  of  edges  can  be  solved  in  polynomial 
time:  Simple  TPMC  can  be  reduced  to  a  minimum  weight  perfect  matching  problem  on  a  general  (non- 
bipartite)  graph  G'  =  ( V',E ')  [4].  Therefore,  it  is  possible  to  reduce  CCTPMC  with  dj  <  2  for  all  j  G  Vi 
to  a  minimum  weight  perfect  matching  problem  with  a  cardinality  constraint  on  a  subset  of  edges.  Hence, 
Corollary  1  implies  that  a  special  class  of  minimum  weight  perfect  matching  problems  with  a  cardinality 
constraint  on  a  subset  of  edges  can  be  solved  in  polynomial  time. 

Note  that  the  intersection  of  the  perfect  matching  polytope  with  a  cardinality  constraint  on  a  strict  subset 
of  edges  is  not  always  integral. 

Example  1.  Consider  the  cycle  C4  of  length  4  with  edge  set  E  =  {{1,  2},  {2, 3},  {3, 4},  {1, 4}},  and  the 
cardinality  constraint  Xu  +  £34  =  1.  The  only  perfect  matchings  are  {{1, 2},  {3, 4}}  and  {{1, 4},  {2, 3}}  for 
which  the  cardinality  constraint  has  activity  2  and  0,  respectively.  Thus  the  perfect  matching  polytope  is  a 
line  which  is  intersected  by  the  hyperplane  defined  by  the  cardinality  constraint  in  the  (fractional)  center. 

To  the  best  of  our  knowledge,  the  complexity  status  of  minimum  weight  perfect  matching  problem  on  a 
general  graph  with  a  cardinality  constraint  on  a  subset  of  edges  is  open.  This  can  be  seen  by  observing  that 
if  one  can  solve  minimum  weight  perfect  matching  problem  with  a  cardinality  constraint  on  a  subset  of  edges 
in  polynomial  time,  then  one  can  solve  the  exact  perfect  matching  problem  in  polynomial  time;  see  discussion 
in  the  last  section  in  [2]. 

Finally  we  ask  the  natural  question:  Does  the  statement  of  Theorem  1  hold  when  dj  <  2  does  not  hold 
for  every  j ?  The  next  example  illustrates  that  the  statement  does  not  hold  in  such  case. 

Example  2.  Consider  an  instance  of  TPMC  where  G  =  (V\  U  Vi,  E)  is  a  bipartite  graph  with 

Vl  =  {il,i2,*3)*4,*5)*6})  Vi  =  {jl,h,j3,j4}, 

E  =  {{*1,  jl},  {*2,  J2},  {*3,  h},  {*4,  jl},  {U,ji},  {*5,  ji},  {*5,  ji},  {*6, 33},  {*6,  J4 } } , 

Si  =  l,i  €  Vi,  dj1  =  dj2  =  dj3  =  2,  dj4  =  3. 

For  k  =  2  it  can  be  verified  that  we  obtain  a  non-integer  extreme  point  of  conv(X)  D  {(x,z)  G  x  [0,  l]n  | 
'Yhj—l  zj  —  >  given  by  X{j1j1y  =  X{i2j2y  =  £{13,^3}  =  £{14,^4}  =  X{i4j4y  —  X{j5j2y  —  2J{i5,j4}  —  x{i6,j3}  = 
x{i6,j4\  =  Zi  =  Zi  =  Z3  =  Z4  =  \.  To  see  this,  consider  the  face  defined  by  the  supply  constraints  of  nodes 
{*4,  *5,  *6}  and  observe  that  this  face  has  precisely  two  solutions  having  1  and  3  markets,  respectively. 

Therefore,  Xk  7^  conv(X)  (~l  {(&,  z)  G  R+  x  [0, 1]"  |  Y^j= 1  zj  —  k }  example. 


2.  Proof  of  Theorem  1 

To  prove  Theorem  1  we  use  an  improved  reduction  to  a  minimum  weight  matching  problem  (compared  to 
the  reduction  in  [4])  and  then  use  the  well-known  adjacency  properties  of  the  vertices  of  the  perfect  matching 
polytope.  Since  the  integrality  result  does  not  hold  for  the  perfect  matching  polytope  on  a  general  graph  with 
a  cardinality  constraint  on  any  subset  of  edges,  as  illustrated  in  Example  1,  we  need  to  refine  the  adjacency 
criterion. 

We  begin  with  some  notation.  For  a  graph  G  =  (V,  E)  with  node  set  V  and  edge  set  E,  and  a  node  v  G  V, 
we  denote  by  S(v)  :=  Sg(v)  :=  {e  G  E  \  v  G  e}  the  set  of  edges  incident  to  v.  For  a  vector  x  G  and  a 
subset  F  C  E  of  its  ground  set,  we  define  x(F)  :=  Y^feFxf- 

We  now  describe  the  improved  reduction  to  a  minimum  weight  matching  problem.  Consider  a  simple 
TPMC  instance  on  a  graph  G  =  ( Vi  U  Vi,E)  with  supplies  s  G  demands  d  G  {1,2}  1^1,  edge  weights 

w  G  ,  and  revenues  r  G  Rly2l.  Let  Dk  =  { j  G  V2  \  dj  =  k}  be  the  partitioning  of  Vi  into  two  classes 
corresponding  to  the  demands. 
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Figure  1:  Improved  Reduction  to  a  Matching  Problem 


We  create  the  auxiliary  graph  G*  (see  Figure  1)  with  nodes  V|SU£h  IJZ)-|  UZ).}u£>}  and  edges  EiUE2UFiUF2 
with 


Vf  =  {it  |  i  G  Vi  and  £  €  {1,2, ,  s.J}, 

Di  =  (j  |  j  G  Di}, 

D2  =  {jk  I  j  G  £>2}  for  k  =  1,2, 

Ei  =  {{h,j}  I  {*,  j}  G  E,  £  G  {1,2,.. .  ,Si}  and  j  G  £>1}, 

E2  =  {{ ie,jk }  I  {i,j}  G  E,  £  G  {1, 2, . . .  ,s»},  j  G  £>2  and  k  G  {1,2}}, 

£1  =  {{jJ}  I  j  G  £>1},  and 
E2  =  { {ji ,  J2}  |  j  G  £>2}. 

In  the  construction  every  node  i  G  V  with  supply  s*  is  split  into  s*  identical  nodes  with  intended  supply 
value  of  1.  Furthermore,  to  every  node  j  G  V2  with  demand  1  we  attach  an  edge  with  a  dead  end  j,  and  every 
node  j  G  V2  with  demand  2  is  split  into  nodes  j-\  and  j'2  which  are  connected  by  an  edge.  Note  that  this  is  a 
polynomial  construction,  because  the  supply,  s,;,  is  at  most  2 1 V2 1  for  any  i  G  V\. 

Lemma  1.  Let  X  C  m}^  x  {0, 1}IW  be  the  set  of  feasible  solutions  of  a  simple  TPMC  instance  on  a  graph 
G  =  (Vi  UV2,E)  with  supplies  s  G  and  demands  d  G  {1,  2} I V2 1 .  Let  the  sets  D /.  and  the  auxiliary  graph 
G*  be  defined  as  above. 

Then  P  :=  conv(V)  is  equal  to  the  projection  of  the  face  of  the  matching  polytope  PmatchiG*)  of  G* 

Q  ■■=  {y  G  PmatchiG*)  I  y(5(v 0)  =  1  for  all  v  G  D\  U  D\  U  £>2} 

via  the  map  n  defined  by  x{iJ}  =  Ylt=\  for  {bil  e  E  and  3  e  Di>  x{i,j}  =  YltLi(y{ie,ji} +y{ie,j2})  for 

{i,j}  G  E  and  j  G  D2,  Zj  =  for  j  G  D1,  and  Zj  =  y{juh}  for  j  G  £>2- 

Proof.  We  first  show  n(Q)  C  P.  Let  y  be  a  vertex  of  Q  and  {x,  z )  =  n(y)  be  the  projection. 

Clearly,  for  all  i  £  V 1  we  have  x{Scii ))  =  J2tLi  vidG*  {i())  <  Si,  i.e.,  (x,z)  satisfies  (3).  For  every 
node  j  G  £>1  we  have  x(SG(j))  +  Zj  =  y(SG*(j)  \  {{j,j}})  +  Vyjy  =  2/(fc*(j))  =  1-  Furthermore,  for 
every  node  j  G  £>2  we  have  x(6G(j))  +  2 zj  =  y(6G*(ji)  \  Uii.^}})  +  J/(^G*(ji)  \  {{.7i,  J2}})  +  2 y{juj2}  = 
y(SG*{ji))  “f  y{dG*  (j'2))  =  2.  Hence,  (x,z)  satisfies  (2)  proving  (x,z)  G  conv(V)  since  z  is  binary. 

We  now  show  P  C  n(Q)  for  which  it  suffices  to  consider  only  integer  points  in  P  since  both  polytopes  are 
integral.  Note  that  P  is  integral  since  for  integral  z  the  remaining  system  is  totally  unimodular  with  integral 
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right-hand  side.  Let  (x,z)  £  P  D  (Z^  x  {0,  l}^)  be  an  integral  point  in  P.  For  j  £  Di,  let  e3  £  E  be 
the  unique  edge  with  xuj\  >  0,  and  for  j  £  D2,  let  {ej,  fj}  be  the  set  of  edges  incident  to  j  with  positive 


a;- value.  Observe  that  if  e3  =  fj  holds,  then  xej  =  2,  and  otherwise  xe.  =  Xf.  =  1. 

Construct  a  matching  M  satisfying 

M  =  {{j,j}  |  j  £  £>i  with  zj  =  1}  U  {{ji,  ji}  \  j  £  D2  with  zj  =  1}  (7) 

U  {{it,  j}  |  j  £  Di  and  i  £  e3  with  Zj  =  0}  (8) 

U  {{ie,ji}  |  j  £  D2  and  i  £  ej  with  Zj  =  0}  (9) 

U  {{it,  32}  |  j  £  D2  and  i  £  fj  with  Zj  =  0}  (10) 


choosing  t  in  (8)-(10)  such  that  every  node  it  £  V{  has  at  most  one  incident  edge  in  M.  This  is  possible 
since  for  each  i  £  V\ ,  G*  has  s,;  identical  copies  i\, . . . ,  iSt  and  M  has  to  contain  at  most  x(5a(i))  <  s*  edges 
incident  to  one  of  the  copies  because  x  is  integral. 

We  first  prove  that  M  is  indeed  a  matching.  A  node  j  £  D 1  is  matched  either  to  j  (if  Zj  =  1)  or  by  Cj. 
Similarly,  either  j\  and  j2  are  matched  by  the  edge  {ji ,j2}  (again  if  z3  =  1)  or  by  e7-  and  fj,  respectively. 

The  fact  that  M  projects  to  (x,z)  is  easy  to  check  by  the  construction  of  M  according  to  (7)-(10).  This 
concludes  the  proof.  □ 

We  now  turn  to  the  proof  of  Theorem  1.  By  definition  of  the  projection  map  7 r  in  Lemma  1,  the  equation 
z(V 2)  =  k  corresponds  to  the  equation  y(F\  U  F2)  =  k  in  Q,  that  is, 

P  n  {{x,  z)  £  K^1  x  [0, 1]|V21  I  z(V2)  =  k}  =  {7 T(y)  I  y  £  Q  with  y{F1  U  Ff)  =  k} 

holds.  Hence,  in  order  to  show  that  the  former  is  integral  (and  since  7r  projects  integral  vectors  to  integral 
vectors),  it  suffices  to  prove  the  following  claim: 

Claim  1.  Let  X  C  x  {0, 1}  1 2 1  be  the  set  of  feasible  solutions  of  a  simple  TPMC  instance  on  a  graph 
G  =  (Vi  U  V2,  E)  with  supplies  s  £  and  demands  d  £  { 1 ,  2 } I ^ I .  Let  the  sets  D *.  and  the  auxiliary  graph 
G*  be  defined  as  above  and  let  Q  be  as  in  Lemma  1. 

Then  {y  £  Q  \  y{F\  U  F2)  =  k}  is  an  integral  polytope  for  any  integer  k  £  Z+. 

Proof.  Let  H  =  {y  \  y(Fi  U  F2)  =  k}  denote  the  intersecting  hyperplane  and  assume,  for  the  sake  of 
contradiction,  that  Q  (~l  H  is  not  integral.  Then  there  must  exist  two  adjacent  (in  Q)  matchings  Mi  and 
M2  defining  an  edge  of  Q  that  is  intersected  by  H  in  its  relative  interior,  i.e.,  \M1  fl  (Fi  U  F2)|  <  k  and 

\m2  n  (Fi  u  f2)\  >  k. 

By  the  adjacency  characterization  of  the  matching  polytope  [3],  the  symmetric  difference  C  :=  M\AM2 
must  be  a  connected  component  (a  cycle  or  a  path)  in  G*  containing  edges  of  Mi  and  M2  in  an  alternating 
fashion. 

We  now  verify  that  there  must  exist  a  path  e-P-f  in  C  of  odd  length  consisting  of  two  edges  e,  f  £ 
C  n  ( Fi  U  F2)  and  a  subpath  P  in  C  \  ( F\  U  F2 ):  If  for  every  choice  of  e,  f  £  M2  fl  C  fl  (F\  U  F2)  there  exists 
an  edge  belonging  to  Mi  fl  ( Fi  U  F2)  in  all  subpath(s)  e-P-f  of  C,  then  M2  can  have  at  most  one  more  edge 
of  Fi  U  F2  than  Mi  in  C.  However  since  M2  contains  at  least  two  more  edges  of  F\  U  F2  than  Mi  does,  we 
have  that  there  exists  a  path  e-P-f  in  C  consisting  of  two  edges  e,  /  £  M2  fl  C  fl  {F\  U  F2)  and  a  subpath  P 
in  C  \  ( Fi  U  F2).  Now  since  e-P-f  is  subpath  of  C  and  e,  /  £  M2,  we  have  that  P  is  of  odd  length. 

Clearly,  since  we  have  P  fl  ( F\  U  F2)  =  0,  all  of  P’s  edges  must  go  between  Vjs  and  Di  U  (D\  U  D\ ).  Since 
P  also  has  odd  length,  one  of  its  endpoints  is  in  Vf .  But  no  edge  in  Pi  U  F2  is  incident  to  any  node  in  Vj,s 
which  yields  a  contradiction.  □ 
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